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PATENT APPLICATION 

Attorney Docket No. 15966-697 (Cura-197) 



NOVEL PROTEINS AND NUCLEIC ACIDS ENCODING SAME 



RELATED APPLICATIONS 

This application claims priority from USSN 60/186,592, filed March 3, 2000; USSN 
60/186,718, filed March 3, 2000; USSN 60/187,293, filed March 6, 2000; USSN 60/187,294, 
5 filed March 6, 2000; USSN 60/190,400, filed March 17, 2000; ; USSN 60/196,018, filed April 7, 
2000; USSN 60/259,548, filed January 3, 2001; each of which is incorporated by reference in its 
entirety. 

BACKGROUND OF THE INVENTION 

The invention relates generally to polynucleotides and polypeptides, as well as vectors, 
10 host cells, antibodies, and recombinant methods for producing these nucleic acids and 
polypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. The disclosed FCTR1, FCTR2, FCTR3, FCTR4, FCTR5, FCTR6 

15 and FCTR7 nucleic acids and polypeptides encoded therefrom, as well as derivatives, homologs, 
analogs and fragments thereof, will hereinafter be collectively designated as "FCTRX" nucleic 
acid or polypeptide sequences. 

In one aspect, the invention provides an isolated FCTRX nucleic acid molecule encoding 
a FCTRX polypeptide that includes a nucleic acid sequence that has identity to the nucleic acids 

20 disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24. In some 

embodiments, the FCTRX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a FCTRX nucleic acid sequence. The invention also includes an isolated nucleic 
acid that encodes a FCTRX polypeptide, or a fragment, homolog, analog or derivative thereof. 

25 For example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. 
The nucleic acid can be, for example, a genomic DNA fragment or a cDNA molecule that 
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includes the nucleic acid sequence of any of SEQ ID NOS: 1, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 
20, 22, and 24. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 
includes at least 6 contiguous nucleotides of a FCTRX nucleic acid (e.g., SEQ ID NOS: 1,3,5, 
5 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24) or a complement of said oligonucleotide. 

Also included in the invention are substantially purified FCTRX polypeptides (SEQ ID 
NO: 2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25). In certain embodiments, the FCTRX polypeptides 
include an amino acid sequence that is substantially identical to the amino acid sequence of a 
human FCTRX polypeptide. 
10 The invention also features antibodies that immunoselectively-binds to FCTRX 

polypeptides, or fragments, homologs, analogs or derivatives thereof. 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a FCTRX nucleic acid, a FCTRX polypeptide, 
15 or an antibody specific for a FCTRX polypeptide. In a further aspect, the invention includes, in 
one or more containers, a therapeutically- or prophylactically-effective amount of this 
pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a FCTRX nucleic acid, under conditions allowing for expression of 
20 the FCTRX polypeptide encoded by the DNA. If desired, the FCTRX polypeptide can then be 
recovered. 

In another aspect, the invention includes a method of detecting the presence of a FCTRX 
polypeptide in a sample. In the method, a sample is contacted with a compound that selectively 
binds to the polypeptide under conditions allowing for formation of a complex between the 
25 polypeptide and the compound. The complex is detected, if present, thereby identifying the 
FCTRX polypeptide within the sample. 

The invention also includes methods to identify specific cell or tissue types based on their 
expression of a FCTRX. 

Also included in the invention is a method of detecting the presence of a FCTRX nucleic 
30 acid molecule in a sample by contacting the sample with a FCTRX nucleic acid probe or primer, 
and detecting whether the nucleic acid probe or primer bound to a FCTRX nucleic acid molecule 
in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
FCTRX polypeptide by contacting a cell sample that includes the FCTRX polypeptide with a 
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compound that binds to the FCTRX polypeptide in an amount sufficient to modulate the activity 
of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic acid, 
peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon containing) or 
inorganic molecule, as further described herein. 
5 Also within the scope of the invention is the use of a Therapeutic in the manufacture of a 

medicament for treating or preventing disorders or syndromes including, e.g., Colorectal cancer, 
adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal alloimmune 
thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours particularly at 
the interface between epithelia and stroma, malignant brain tumors, mammary tumors, human 

10 gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 

adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 

1 5 tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in tumor escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 

20 eosinophilic lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection, Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia, Corneal dystrophy - Groenouw type I, Corneal dystrophy - 

25 lattice type I, and Reis-Bucklers corneal dystrophy. The Therapeutic can be, e.g. , a FCTRX 
nucleic acid, a FCTRX polypeptide, or a FCTRX-specific antibody, or biologically-active 
derivatives or fragments thereof 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., Also within the scope of the invention is the use of a Therapeutic in 

30 the manufacture of a medicament for treating or preventing disorders or syndromes including, 
e.g., Colorectal cancer, adenomatous polyposis coli, myelogenous leukemia, congenital ceonatal 
alloimmune thrombocytopenia, multiple human solid malignancies, malignant ovarian tumours 
particularly at the interface between epithelia and stroma, malignant brain tumors, mammary 
tumors, human gliomas, astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast 
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adenocarcinoma, ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell 
carcinomas, autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine 
stimulation of tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal 
and basement membrane invasion and motility of tumor cells thereby contributing to metastasis, 
5 tumor-mediated immunosuppression of T-cell mediated immune effector cells and pathways 
resulting in tumor escape from immune surveilance, neurological disorders, neurodegenerative 
disorders, nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 

10 eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection, Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia, Corneal dystrophy - Groenouw type I, Corneal dystrophy - 

15 lattice type I, and Reis-Bucklers corneal dystrophy. The method includes contacting a test 
compound with a FCTRX polypeptide and determining if the test compound binds to said 
FCTRX polypeptide. Binding of the test compound to the FCTRX polypeptide indicates the test 
compound is a modulator of activity, or of latency or predisposition to the aforementioned 
disorders or syndromes. 

20 Also within the scope of the invention is a method for screening for a modulator of 

activity, or of latency or predisposition to an disorders or syndromes including, e.g., Also within 
the scope of the invention is the use of a Therapeutic in the manufacture of a medicament for 
treating or preventing disorders or syndromes including, e.g., Colorectal cancer, adenomatous 
polyposis coli, myelogenous leukemia, congenital ceonatal alloimmune thrombocytopenia, 

25 multiple human solid malignancies, malignant ovarian tumours particularly at the interface 
between epithelia and stroma, malignant brain tumors, mammary tumors, human gliomas, 
astrocytomas, mixed glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, 
ovarian cancer, melanomas, renal cell carcinoma , clear cell and granular cell carcinomas, 
autocrine/paracrine stimulation of tumor cell proliferation, autocrine/paracrine stimulation of 

30 tumor cell survival and tumor cell resistance to cytotoxic therapy, paranechmal and basement 
membrane invasion and motility of tumor cells thereby contributing to metastasis, tumor- 
mediated immunosuppression of T-cell mediated immune effector cells and pathways resulting 
in tumor escape from immune surveilance, neurological disorders, neurodegenerative disorders, 
nerve trauma, familial myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, 
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demyelinating Gardner syndrome, familial myelodysplastic syndrome; mental health conditions, 
immunological disorders, allergy and infection, asthma, bronchial asthma, Avellino type 
eosinophilia, lung diseases, reproductive disorders, male infertility, female reproductive system 
disorders, male and female reproductive diseases, hemangioma, deafness, glycoprotein la 
5 deficiency, desmoid disease, turcot syndrome, liver cirrhosis, hepatitis C, gastric disorders, 
pancreatic diseases like diabetes, Schistosoma mansoni infection, Spinocerebellar ataxia, 
Plasmodium falciparum parasitemia, Corneal dystrophy - Groenouw type I, Corneal dystrophy - 
lattice type I, and Reis-Bucklers corneal dystrophy by administering a test compound to a test 
animal at increased risk for the aforementioned disorders or syndromes. The test animal 

10 expresses a recombinant polypeptide encoded by a FCTRX nucleic acid. Expression or activity 
of FCTRX polypeptide is then measured in the test animal, as is expression or activity of the 
protein in a control animal which recombinantly-expresses FCTRX polypeptide and is not at 
increased risk for the disorder or syndrome. Next, the expression of FCTRX polypeptide in both 
the test animal and the control animal is compared. A change in the activity of FCTRX 

1 5 polypeptide in the test animal relative to the control animal indicates the test compound is a 
modulator of latency of the disorder or syndrome. 

In yet another aspect, the invention includes a method for determining the presence of or 
predisposition to a disease associated with altered levels of a FCTRX polypeptide, a FCTRX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 

20 amount of the FCTRX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the FCTRX polypeptide present in a 
control sample. An alteration in the level of the FCTRX polypeptide in the test sample as 
compared to the control sample indicates the presence of or predisposition to a disease in the 
subject. Preferably, the predisposition includes, e.g., Also within the scope of the invention is 

25 the use of a Therapeutic in the manufacture of a medicament for treating or preventing disorders 
or syndromes including, e.g., Colorectal cancer, adenomatous polyposis coli, myelogenous 
leukemia, congenital ceonatal alloimmune thrombocytopenia, multiple human solid 
malignancies, malignant ovarian tumours particularly at the interface between epithelia and 
stroma, malignant brain tumors, mammary tumors, human gliomas, astrocytomas, mixed 

30 glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, melanomas, 
renal cell carcinoma , clear cell and granular cell carcinomas, autocrine/paracrine stimulation of 
tumor cell proliferation, autocrine/paracrine stimulation of tumor cell survival and tumor cell 
resistance to cytotoxic therapy, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis, tumor-mediated immunosuppression of T-cell 
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mediated immune effector cells and pathways resulting in tumor escape from immune 
surveilance, neurological disorders, neurodegenerative disorders, nerve trauma, familial 
myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, demyelinating Gardner syndrome, 
familial myelodysplastic syndrome; mental health conditions, immunological disorders, allergy 
5 and infection, asthma, bronchial asthma, Avellino type eosinophilic lung diseases, reproductive 
disorders, male infertility, female reproductive system disorders, male and female reproductive 
diseases, hemangioma, deafness, glycoprotein la deficiency, desmoid disease, turcot syndrome, 
liver cirrhosis, hepatitis C, gastric disorders, pancreatic diseases like diabetes, Schistosoma 
mansoni infection, Spinocerebellar ataxia, Plasmodium falciparum parasitemia, Corneal 

10 dystrophy - Groenouw type I, Corneal dystrophy - lattice type I, and Reis-Bucklers corneal 

dystrophy. Also, the expression levels of the new polypeptides of the invention can be used in a 
method to screen for various cancers as well as to determine the stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject a 

15 FCTRX polypeptide, a FCTRX nucleic acid, or a FCTRX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g., Also within the scope of the invention is the 
use of a Therapeutic in the manufacture of a medicament for treating or preventing disorders or 
syndromes including, e.g., Colorectal cancer, adenomatous polyposis coli, myelogenous 

20 leukemia, congenital ceonatal alloimmune thrombocytopenia, multiple human solid 

malignancies, malignant ovarian tumours particularly at the interface between epithelia and 
stroma, malignant brain tumors, mammary tumors, human gliomas, astrocytomas, mixed 
glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, melanomas, 
renal cell carcinoma , clear cell and granular cell carcinomas, autocrine/paracrine stimulation of 

25 tumor cell proliferation, autocrine/paracrine stimulation of tumor cell survival and tumor cell 
resistance to cytotoxic therapy, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis, tumor-mediated immunosuppression of T-cell 
mediated immune effector cells and pathways resulting in tumor escape from immune 
surveilance, neurological disorders, neurodegenerative disorders, nerve trauma, familial 

30 myelodysplastic syndrome, Charcot-Marie-Tooth neuropathy, demyelinating Gardner syndrome, 
familial myelodysplastic syndrome; mental health conditions, immunological disorders, allergy 
and infection, asthma, bronchial asthma, Avellino type eosinophilia, lung diseases, reproductive 
disorders, male infertility, female reproductive system disorders, male and female reproductive 
diseases, hemangioma, deafness, glycoprotein la deficiency, desmoid disease, turcot syndrome, 
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liver cirrhosis, hepatitis C, gastric disorders, pancreatic diseases like diabetes, Schistosoma 
mansoni infection, Spinocerebellar ataxia, Plasmodium falciparum parasitemia, Corneal 
dystrophy - Groenouw type I, Corneal dystrophy - lattice type I, and Reis-Bucklers corneal 
dystrophy. 

5 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 
1 0 meaning as commonly understood by one of ordinary skill in the art to which this invention 

belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
15 specification, including definitions, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION 

20 The invention is based, in part, upon the discovery of novel nucleic acid sequences that 

encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are referred 
to individually as FCTR1, FCTR2, FCTR3, FCTR4, FCTR5, FCTR6, and FCTR7. The nucleic 
acids, and their encoded polypeptides, are collectively designated herein as "FCTRX". 
The novel FCTRX nucleic acids of the invention include the nucleic acids whose 

25 sequences are provided in Tables 1 A, 2A, 3A, 3C, 3E, 3F, 3G, 3H, 4A, 5A, 5C, 5E, 6A 5 6C, and 
7 A inclusive ("Tables 1 A - 7 A"), or a fragment, derivative, analog or homolog thereof. The 
novel FCTRX proteins of the invention include the protein fragments whose sequences are 
provided in Tables IB, 2B, 3B, 31, 4B, 5B, 5D, 6B, 6D, and 7B inclusive ("Tables IB - 7B"). 
The individual FCTRX nucleic acids and proteins are described below. Within the scope of this 

30 invention is a method of using these nucleic acids and peptides in the treatment or prevention of 
a disorder related to cell signaling or metabolic pathway modulation. 



7 



15966-697 



FCTR1 

Novel FCTR1 is a growth factor ("FCTR") protein related to follistatin-like gene, and 
mac25. FCTR1 (also referred to by proprietary accession number 58092213.0.36) is a full-length 
clone of 771 nucleotides, including the entire coding sequence of a 105 amino acid protein from 
5 nucleotides 438 to 753. The clone was originally obtained from thyroid gland, kidney, fetal 
kidney, and spleen tissues. 

The nucleotide sequence of FCTR1 as presently determined is reported in Table 1 A. The 
start and stop codons are bolded and the 5' and 3' untranslated regions are underlined. 

Table 1A. FCTR1 nucleotide sequence (SEQ ID NO:l). 

10 GGTCCTCACCCCCTTCCTCTCTCCGAGCCTCGGTGTCTGGTTACGGCTCCTCTGCTCGCATTGTGACTTTGGGCCAGGCTGGGGGA 
AATGACCCGGGAGGGTCCCATGCGGCTACATAAAATTGGCAGGCTTAGAACTAGTGGGAAGGCGGGTGCGCGAAGTCGAGGGGCGG 
AGAGAGGGGGCCGGAGGAGCTGCTTTCTGAATCCAAGTTCGTGGGCTCTCTCAGAAGTGCTCAGGACGGAGCAGAGGTGGCCGGCG 
GGCCCGGCTGACTGCGCCTCTGCTTTCTTTCCATAACCTTTTCTTTCGGACTCGAATCACGGCTGCTGCGAAGGGTCTAGTTCCGG 
ACACTAGGGCCCCAGATCGTGTCACATCCATATGACACTTGGAATGTGACAGGGCAGGATGTGATCTTTGGCTGTGAAGTGTTTGC 

15 CTAC CCCA TGG C CT C CAT CGAGTGGAGGAAGGATGGCTTGGACAT CCAGCTGC CAGGGGATGAC CCCCACAT CTCTGTG CAGT TTA 

GGGGTGGACCCCAGAGGTTTGAGGTGACTGGCTGGCTGCAGATCCAGGCTGTGCGTCCCAGTGATGAGGGCACTTACCGCTGCCTT 
GCCCGCAATGCCCTGGGTCAAGTGGAGGCCCCTGCTAGCTTGACAGTGCTCACACCTGACCAGCTGAACTCTACAGGCATCCCCCA 
rtPTnraATParTaAarrTanTTrrTnA^nAnnA^CTGAGAGTGAAGAGAATGACGATTACTACTAG GTCCAGAGCTCTGGCC 

20 The predicted amino acid sequence of FCTR1 protein corresponding to the foregoing 

nucleotide sequence is reported in Table IB. FCTR1 was searched against other databases using 
SignalPep and PSort search protocols. The protein is most likely located in the cytoplasm 
(certainty=0.6500) and seems to have no N-terminal signal sequence. The predicted molecular 
weight of FCTR1 protein is 1 171 1.8 daltons. 

25 Table IB. Encoded FCTR1 protein sequence (SEQ ID NO:2). 

MAS I EWRKDGLD I QLPGDDPH I SVQ FRGGPQRFEVTGWLQ 
SLNLVPEEEAESEENDDYY 

FCTR1 was initially identified with a TblastN analysis of a proprietary sequence file for a 
30 follistatin-like probe or homolog which was run against the Genomic Daily Files made available 
by GenBank. A proprietary software program (GenScan™) was used to further predict the 
nucleic acid sequence and the selection of exons. The resulting sequences were further modified 
by means of similarities using BLAST searches. The sequences were then manually corrected 
for apparent inconsistencies, thereby obtaining the sequences encoding the full-length protein. 
35 In an analysis of sequence databases, it was found, for example, that the FCTR1 nucleic 

acid sequence has 31/71 bases (43%) identical and 46/71 bases positively alike to a Mus 
Musculus IGFBP-like protein (TREMBL Accession Number:BAA21725) shown in Table 1C. 
In all BLAST alignments herein, the "E-value" or "Expect" value is a numeric indication of the 
probability that the aligned sequences could have achieved their similarity to the BLAST query 

8 15966-697 



sequence by chance alone, within the database that was searched. For example, as shown in 
Table 1C, the probability that the subject ("Sbjct") retrieved from the FCTR1 BLAST analysis, 
in this case the Mus Musculus IGFBP-like protein, matched the Query FCTR1 sequence purely 
by chance is 1.2xl0" n . 

5 Table 1C. BLASTP of FCTR1 against Mus Musculus IGFBP-like protein (SEQ ID NO:38) 

PTNR : REMTREMBL - ACC : BAA2 1725 IGFBP-LIKE PROTEIN - MUS MUSCULUS (MOUSE) , 270 AA* 





LENGTH = 


270 




10 


SCORE = 161 
IDENTITIES = 


(56.7 BITS) , EXPECT = 1.2E-11, P = 1.2E-11 
= 31/71 (43%), POSITIVES = 46/71 (64%) 


15 


QUERY : 
SBJCT : 


9 

191 


DGLDIQLPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPAS 6 8 

+ 11+ +11 1 1 +I + M MM 1 1 1+ t +1 llllll 11 + 1+ ++ + 
EGLE-ELPGDHVNIAVQVRGGPSDHETTSWILINPLRKEDEGVYHCHAANAIGEAQSHGT 24 9 




QUERY: 


69 


LTVLTPDQLNS 79 

+111 ++ 1 
VTVLDLNRYKS 260 




SBJCT : 


250 



The amino acid sequence of FCTR1 also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Mus Musculus Follistatin-like Protein shown in Table ID. 



Table ID. BLASTP of FCTR1 against Mus Musculus Follistatin-like Protein (SEQ ID 
25 NO:39) 



PTNR:SPTREMBL-ACC:Q61581 FOLLISTATIN-LIKE 2 (FOLLISTATIN-LIKE PROTEIN) - MUS MUSCULUS 
(MOUSE) , 238 AA. 

30 LENGTH =238 

SCORE = 149 (52.5 BITS), EXPECT = 1.5E-10, P = 1.5E-10 
IDENTITIES = 26/58 (44%); POSITIVES =» 38/58 (65%) 

35 QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

MM ++++I 1 1 1 1++ M M 1+ + + I I I I I 1+ M 11+11 + 

SBJCT: 165 LPGDRENLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASAAAKITW 222 

40 The amino acid sequence of FCTR1 also had 26/58 bases (44%) identical, and 38/58 

bases (65%) positive for Homo sapiens MAC25 protein shown in Table IE. 



Table IE. BLASTP of FCTR1 against Homo sapiens MAC25 protein (SEQ ID NO:40) 



45 



PTNR:SPTREMBL-ACC:Q07822 MAC25 PROTEIN - HOMO SAPIENS (HUMAN) , 277 AA. 

LENGTH = 277 



SCORE a 149 (52.5 BITS) , EXPECT = 3.2E-10, P = 3.2E-10 
50 IDENTITIES = 26/58 (44%), POSITIVES - 38/58 (65%) 



QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

MM ++++I IIII++ 11111+ + + I S ! I I 1+ 1 1 11+11 + 

SBJCT: 209 LPGDRDNLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASASAKITVV 266 

9 15966-697 



The amino acid sequence of FCTR1 also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Mus musculus MAC25 protein shown in Table IF. 

5 

Table IF. BLASTP of FCTR1 against Mus musculus MAC25 protein (SEQ ID NO:41) 

PTNR:SPTREMBL-ACC:088812 MAC25 - MUS MUSCULUS (MOUSE), 281 AA 

LENGTH = 2 81 

10 

SCORE = 149 (52.5 BITS), EXPECT = 3.4E-10, P ~ 3.4E-10 
IDENTITIES « 26/58 (44%), POSITIVES * 38/58 (65%) 

QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

15 I I I I ++++I IIII++ IIIII+ + + i I I I I 1+ II 11+11+ 

SBJCT: 208 LPGDRENLAIQTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASAAAKITW 265 

The amino acid sequence of FCTR1 also had 26/58 bases (44%) identical, and 38/58 
bases (65%) positive for Homo sapiens Prostacyclin-stimulating factor shown in Table 1G. 

20 Table 1G. BLASTP of FCTR1 against Homo sapiens Prostacyclin-stimulating factor (SEQ 

IDNO:42) 

PTNR:SPTREMBL-ACC:Q16270 PROSTACYCLIN - STIMULATING FACTOR - HOMO SAPIENS (HUMAN) , 282 
AA 

25 LENGTH = 282 

SCORE = 149 (52.5 BITS), EXPECT = 3.4E-10, P = 3.4E-10 
IDENTITIES = 26/58 (44%) , POSITIVES = 38/58 (65%) 

30 QUERY: 15 LPGDDPHISVQFRGGPQRFEVTGWLQIQAVRPSDEGTYRCLARNALGQVEAPASLTVL 72 

MM + + ++ | |||| ++ IIIH+ + + | | | | | |+ || M+M + 
SBJCT: 209 LPGDRDNLAI QTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASASAKITW 266 

The amino acid sequence of FCTR1 also had 18/44 bases (40%) identical, and 25/44 
35 bases (56%) positive for rat Colorectal cancer suppressor shown in Table 1H. 



Table 1BL BLASTP of FCTR1 against rat Colorectal cancer suppressor (SEQ ID NO:43) 

PTNR:PIR-ID:B40098 COLORECTAL CANCER SUPPRESSOR DCC - RAT ( FRAGMENTS } 

40 LENGTH = 144 

SCORE = 78 (27.5 BITS), EXPECT = 1.1E-05, SUM P(2) = 1.1E-05 
IDENTITIES = 18/44 (40%), POSITIVES = 25/44 (56%) 

45 QUERY: 3 3 FEVTGW- -LQIQAVRPSDEGTYRCLARNALGQVEAPASLTVLTP 74 

I++ I H i I I I I l + l + l I I ++ I I I I 

SBJCT: 101 FQIVGGSNLRILGWKSDEGFYQCVAENEAGNAQSSAQLIVPKP 144 

SCORE = 37 (13.0 BITS), EXPECT « 1.1E-05, SUM P(2) = 1.1E-05 
50 IDENTITIES = 8/19 (42%), POSITIVES = 12/19 (63%) 

QUERY: 1 MASIEWRKDGLDIQL-PGD 18 

I +1 1+1+ 1+ III 
SBJCT : 3 0 MPTIHWQKNQQDLTPNPGD 4 8 

55 
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10 



50 



The amino acid sequence of FCTR1 also had 32/83 bases (38%) identical, and 45/83 
bases (54%) positive to bases 55-137, and 24/68 bases (35%) identical, and 37/68 bases (54%) 
positive to bases 166-225 of Homo sapiens PTPsigma-(Brain) Precursor shown in Table II. 

Table II. BLAST? of FCTR1 against Homo sapiens PTPsigma-(Brain) Precursor (SEQ ID 

NO:44) 

PTNR:TREMBLNEW-ACC:AAD09360 PTPSIGMA- (BRAIN) PRECURSOR - HOMO SAPIENS (HUMAN), 1502 
AA. 

LENGTH = 1502 

SCORE = 109 (38.4 BITS), EXPECT = 0.00010, P = 0.00010 
IDENTITIES « 32/83 (38%), POSITIVES « 45/83 (54%) 



QUERY: 14 QLPGDD - PHI S VQFRG GPQRFEVTGW LQIQAVR- PSDEGTYRCLARNALG 61 

15 I II I +1 1 1 1 1 + Ml +1 I II I I + I + 

SBJCT: 55 QATGDPKPRVTWNKKGKKVNSQRFETIEFDESAGAVLRIQPLRTPRDENVYECVAQNSVG 114 
QUERY: 62 QVEAPASLTVLTPDQLNSTGI PQL 85 

I Mil III I M + 

20 SBJCT: 115 EITVHAKLTVLREDQLPS-GFPNI 13 7 

SCORE = 77 (27.1 BITS), EXPECT =0.25, P = 0.22 
IDENTITIES = 24/68 (35%), POSITIVES = 37/68 (54%) 

25 QUERY: 4 I EWRKDGLD I QLPGDDPHI S VQFRGGPQRFE VTGWLQ I QAVRPSDEGT YRCLARNALG - Q 62 

I I II I + Ml I ++ +1 III++ +1+1 I l+l 1+ I + 

SBJCT: 166 ITWFKDFLPV DPSAS NGR I KQLR - SGALQ I ES S EETDQGKYECVATNSAGVR 216 

QUERY: 63 VEAPASLTV 71 

30 + || + | | 

SBJCT: 217 YS SPANLYV 225 

The amino acid sequence of FCTR1 also had 32/83 bases (38%) identical, and 45/83 
bases (54%) positive for amino acids 55-137 and 26/69 bases (37%) identical, and 38/69 (54%) 
35 positive for amino acids 166-234 of Homo sapiens Protein-Tyrosine Phosphatase Sigma shown 
in Table 1J. 

Table IX BLASTP of FCTR1 against Homo sapiens PTPsigma-(Brain) Precursor (SEQ ID 

NO:45) 

PTNR:SPTREMBL-ACC:Q13332 PROTEIN -TYRO SINK PHOSPHATASE, RECEPTOR -TYPE, S PRECURSOR (EC 
40 3.1.3.48) (PROTEIN- TYRO SINE PHOSPHATASE SIGMA) (R-PTP-SIGMA) (PTPRS) - HOMO SAPIENS 
(HUMAN) , 1948 AA. 

LENGTH = 1948 

SCORE = 109 (38.4 BITS), EXPECT = 0.00013, P = 0.00013 
45 IDENTITIES = 32/83 (38%), POSITIVES = 45/83 (54%) 

QUERY: 14 QLPGDD -PHIS VQFRG GPQRFEVTGW LQIQAVR- PSDEGT YRCLARNALG SI 

I li I + + +1 Mil + Ml HIM I M + I++I 

SBJCT: 55 QATGDPKPRVTWNKKGKKVNSQRFETIEFDESAGAVLRIQPLRTPRDENVYECVAQNSVG 114 

QUERY: 62 QVEAPASLTVLTPDQLNSTGI PQL 85 

++ Mill III I I h 
SBJCT; 115 EITVHAKLTVLREDQLPS-GFPNI 13 7 



55 SCORE = 88 {31.0 BITS) , EXPECT = 0.023, P = 0.022 

IDENTITIES - 26/69 (37%), POSITIVES = 38/69 (55%) 
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QUERY : 
SBJCT : 
QUERY : 
SBJCT : 



4 IE WRKDGLD IQLPGDDPHI SVQ FRGGPQRFEVT GWLQIQAVRPSDEGTYRCLARNAL SO 

I I II I + - I I I +111 I 1II++ +1+1 i l+l 1+ 

166 ITWFKDFLPVDPSASNGRIK-QLRS- -ETFESTPIRGALQIESSEETDQGKYECVATNSA 222 

61 G-QVEAPASLTV 71 

I + +11+1 I 
223 GVRYSSPANLYV 234 



A ClustalW analysis comparing the protein of the invention with related protein 
sequences is given in Table IK, with FCTR1 shown on line 2. In the ClustalW alignment of the 
FCTR1 protein, as well as all other ClustalW analyses herein, the black outlined amino acid 
residues indicate regions of conserved sequence (Le. 9 regions that may be required to preserve 
structural or functional properties), whereas non-highlighted amino acid residues are less 
conserved and can potentially be mutated to a much broader extent without altering protein 
structure or function. 

Table IK. ClustalW Analysis of FCTR1 

1) Q07822 MAC25 PROTEIN. (SEQ ID NO:40) 

2} Q16270 PROSTACYCLIN- STIMULATING FACTOR. (SEQ ID NO: 42) 

3)Q61581_F0LLISTATIN-LIKE 2: FOLLISTATIN-LIKE 2 (FOLLISTATIN-LIKE PROTEIN) (SEQ 
ID NO:39) 

4) BAA21725 IGFBP-LIKE PROTEIN (SEQ ID NO: 38) 

5) FCTR1 (SEQ ID N0:2) 

6) B40098 COLORECTAL CANCER SUPPRESSOR DCC - RAT (FRAGMENTS) (SEQ ID NO: 43) 



Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTR1 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTR1 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTR1 

B40098 

Q07822 

Q16270 

Q61581_ 

BAA21725 

FCTR1 

B40098 

Q07822 

Q16270 

Q6 15 8 1_ 

BAA21725 

FCTR1 

B40098 




|EPASgPLP|LGCLLGE 
PAS^PLP|LGCLLGET| 



fQD] 



sfacPAPWis, 





AR S LG t§§ AS IS gPVGS 





PiRFLSQTESIT 



ITQVSKGTCEQGPS IVTPPKDIWNVTGAQV 
ITQVSKGTCEQGPSIVTPPKDIWNVTGAQV 
ITQVSKGTCEQGPS IVTPPKDIWNVTGAQV 



gPRAHHGH XaKARD|Pg|Fg{|\ 
pFMgDgVLLKgEf ltoPMPTIHWQKNQQDLTPNPGDSRVWP^FlNHgSlS^AYESMD| 

SO 




LNRYKSi YSSVPgJj 

|TPDQLNSTGIPQLRSLNLVPEEEAESEgND| 
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Q07822 

Q16270 

Q61581__ 

BAA21725 

FCTR1 



IGFBP is expressed in neurostem cell and developing central nervous system. MAC-25, a 
follistatin like protein is a growth suppressor of osteosarcoma cells, and meningiomas. DCC is 
expressed in most normal tissues especially in colonic mucosa, but is deleted in colorectal 
10 cancers. 

Since FCTR1 has similarity to these proteins (shown in BlastP, Tables 1C-1 J, and in 
clustalW, Table IK) it is likely that it has similar function. Therefore FCTR1 could function as 
on or more of the following: a tumor suppressor geneor regulator of neurological system 
development. 

1 5 Based on the protein similarity and tissue expression, FCTR1 may be useful in the 

following diseases and uses: 

(i) Tissue regeneration in vitro and in vivo 

(ii) Neurological disorders, neurodegenerative disorders, nerve trauma 

(iii) Reproductive health 

20 (iv) Immunological disorders, allergy and infection 

(v) In cancer as a diagnostic and prognostic marker, as well as a protein therapeutic 

FCTR2 

FCTR2 (alternatively referred to herein as AC012614_1 .0. 123), is a growth factor 
bearing sequence similarity to human KIAA1061 protein and to genes involved in neuronal 

25 development and reproductive physiology (e.g., cell adhesion molecules, follistatin, roundabout 
and frazzled). FCTR2 is a full-length clone of 5502 nucleotides, including the entire coding 
sequence of a 815 amino acid protein. This sequence is expressed in glioma, osteoblast, other 
cancer cells, lung carcinoma, small intestine (This sequence maps to Unigene Hs. 123420 which 
is expressed in brain, breast, kidney, pancreas, pooled tissue). 

30 A FCTR2 ORF begins with an ATG initiation codon at nucleotides 420-422 and ends 

with a TGA codon at nucleotides 2865-2867. Putative untranslated regions upstream from the 
initiation codon and downstream from the termination codon are underlined in Table 2A, and the 
start and stop codons are in bold letters. 

Table 2A. FCTR2 Nucleotide Sequence (SEQ ID NO:3). 

35 CAATTTCACACAGGAAACAGCTATGCCATGATT^ 

TGCTGGAATTCGGCTTACTCACTATAGGGCTCGAGCGGCTGCCCGGGCAGGTCATTAATTCCATTTCTTTTTAGAGTATC 
ACAGCTTTCTCCTTCACTGACCACCCTTTGCTTCCTGTCAGAAAGCCCTGGACAGAACTCTCTGTGGGATTCTGCCCATG 
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TTTCTGAGATATCGCCTCAA.TTGTCCTGGCTGGGCTGTCGGGTCTGCCCGTTTTACAGATGGGCAAACTGGAGTGGGAA.G 

TATCCGGGTGGCTTCCTCAGGCCTGCAGCTGGTGGAGCAGCTACTGAAACAATCAGGAGCCCAGAAGCTTTGAAGTCAGA 

AGAAGAGAAGACTCCCAGAA TGCAGTGTGATGTTGGTGATGGACGCCTGTTTCGCCTTTCACTTAAACGTGCCCTTTCGA 

GCTGCCCTGACCTCTTTGGGCTTTCCAGCCGCAACGAGCTGCTGGCCTCCTGCGGGAAGAAGTTCTGCAGCCGAGGGAGC 

CGGTGCGTGCTCAGCAGGAAGACAGGGGAGCCCGAATGCCAGTGCCTGGAGGCATGCAGGCCCAGCTACGTGCCTGTGTG 

CGGCTCTGATGGGAGGTTTTATGAAAACCACTGTAAGCTCCACCGTGCTGCTTGCCTCCTGGGAAAGAGGATCACCGTCA 

TCCACAGCAAGGACTGTTTCCTCAAAGGTGACACGTGCACCATGGCCGGCTACGCCCGCTTGAAGAATGTCCTTGTGGCA 

CTCCAGACCCGTCTGCAGCCACTCCAAGAAGGAGAC^GCAGACAAGACCCTGCCTCCCAGAAGCGCCTCCTGGTGGAATC 

TCTGTTCAGGGACTTAGATGCAGATGGCAATGGCCACCTCAGCAGCTCCGAACTGGCTCAGCATGTGCTGAAGAAGCAGG 

ACCTGGATGAAGACTTACTTGGTTGCTCACCAGGTGACCTCCTCCGATTTGACGATTACAACAGTGACAGCTCCCTGACC 

CTCCGCGAGTTCTACATGGCCTTCCAAGTGGTTCAGCTCAGCCTCGCCCCCGAGGACAGGGTCAGTGTGACCACAGTGAC 

CGTGGGGCTGAGCACAGTGCTGACCTGCGCCGTCCATGGAGACCTGAGGCCACCAATCATCTGGAAGCGCAACGGGCTCA 

CC CTGAACTTCCTGGACT TGGAAGACATCAATGACTTTGGAGAGGATGAT T C CCTGTACAT CACCAAGGTGAC CAC CAT C 

CACATGGGCAATTACACCTGCCATGCTTCCGGCCACGAGCAGCTGTTCCAGACCCACGTCCTGCAGGTGAATGTGCCGCC 

AGTCATCCGTGTCTATCGAGAGAGCCAGGCACAGGAGCCTGGAGTGGCAGCCAGCCTAAGATGCCATGCTGAGGGCATTC 

CCATGCCCAGAATCACTTGGCTGAAAAACGGCGTGGATGTCTCAACTCAGATGTCCAAACAGCTCTCCCTTTTAGCCAAT 

GGGAGCGAACTCCACATC^GC^GTGTTCGGTATGAAGACACAGGGGCATACACCTGCATTGCCAAAAATGAAGTGGGTGT 

GGATGAAGATATCTCCTCGCTCTTCATTGAAGACTCAGCTAGAAAGACCCTTGCAAACATCCTGTGGCGAGAGGAAGGCC 

T CAG CGTGGGAAACATGTT CTATGT CTTCT C CGACGACGGTAT CATCGT CAT CCATCCTGTGGACT GTGAGAT C CAGAGG 

CACCTCAAACCCACGGAAAAGATTTTCATGAGCTATGAAGAAATCTGTCCT^^ 

CCAGTGGGTATCTGCAGTCAATGTCCGGAACCGGTACATCTATGTGGCCCAGCCAGCAC^^ 

ACATCCAAGCCCAGAAAGTCCTACAGTCCATAGGTGTGGACCCTCTGCC 

CAAGTGTGGGTCCTGAGCTGGGGGGACGTGCACAAGTCCCGACCAAGTCTCC^ 

GAGCC^G»CCTCATCCGCACA£CCTTTGCAGGAGTGGATGATTTC 

TCAGGTTTGGCTTCATCTTCAACAAGTCTGATCCTGCAGTCCAGAAGGTC 

ATCGGCCTGCACCACCATGGCTGCGTGCCCCAGGCCATGGCACACACCCACCTGGGCGGCTACT 

ACAGGACAGCCCCGCCTCTGCTGCCCGACAGCTGCTCGTTGACAGTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATG 
TAACAGGCACCCCACACACATCCCCCGACGGGCGCTTCATAGTCAGTGCTGC^ 
GAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGCAAATAAACTCGGG^ 
CTCCTTCACTGAAAGCAATCAATACAACATCTACGCGGCTCTC 

CGGGGAAGGTGGGCATGCTGAAGAACTTAAAGGAGCCACCCGCAGGGCCAGCTCAGCCCTGGGGG^ 
ATGAGGGACAGTGGGCTGTTTGGACAGTACCTCCTCACACCAGCCCGAGAGTC^ 

CACGCTGCGGTGTGAGGTGTCAGGTATAAAGGGGGGGACCACAGTGGTGTGGGTGGGTGAGGTATG AAGGGCCCAGAGC^ 
GAGCCCTGGGCCAAGGAACACCCCCTAGTCCTGACACTGCAGCCTCAAGCAGG 

CAAAAACCTGTACTCGCTTTGTGGTTCAACACTGGTCTCCTTGCAAGTTTCCTAGTATAAGGTATGCGCTGCTACCAAGA 

TTGGGGT TT TT TCGTTAGGAAGTATGATT TATGCCTTGAG CTACGATGAGAACATATGC TGCTGTGTAAAGGGAT CATTT 

CTGTGCCAAGCTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCTCCAAGGA 

GTTGCCTTC^CATAGTC^TTGTCCCTTACTGCCAGACCCAGCCAGACTTTGCCCTGACGGAGTGGCCCGGAAGCAGAGGC 

CGACC^GGAGCAGGGGCCTCCCTCCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTG 

CTTCTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTGCATCTTGGGGGATGGGG 

AAATCACTCACTTTATTTTGGAAATTTTTGATTAAAAAAAAATTTTATAATC^ 

T CCGAGGT CCAACTATAT C CTT C C CTG CCT TAGG CCGAGT CT CGGGGGTGGT CACAAC C C CACAT C C CACAG C CAGAAAG 

AACAATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCACCCTGCTTCTCC^ 

GTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACGAGAACAAGTTCTGTGTGCCT^ 

TCTCATTAGTCAGGTCTGGTACCCCAGATTCAGGGCAGACTGGGCTTGCCTGGC^GGTATGGGTGGCCTCC^GGCTC^ 
TGC^GAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAACAGATTTTGTTTTCCTAC 
CTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCGCATCAGCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTCT 
TACCCACATTCCCCAATCAATACACACACACTGCAGAACCCAGAAC^ 

TATGTGTCTC^GGCTGTGGTGACTCTCAC^TGGGCATCGAAGAAGTACAACCCACATAGCCCTCTGGAGACCGCCTAGAT 

CAGAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATATGAGTGGAA 

TTTTGCAAGCGACACGGGTTGGGAGAGGTGGTCTCACCACAGACGTCTTTG 

GACCAGGATTTTCCTTTGCCATTAAGGAATGAACTCTTTCAAGGAGAGGAAACCCTAGACTCTGTGTCACTCTCAACACA 

CACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCATCCCCCGCCCCA 

GTCTGACGCAACTTGGTCCACCAAACGCCTGTCCCCTGTAACTCCTAGGGG 

TTTTAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAAT^ 

TGCATGGATGACCCATCTCACCCCTTTTTTTTTCCTGCCTCAATATCTTGATA 

TTTACCACTAAAATTCTCCAACTTTCATAAACTTTTTTTTGGAAAAATTTCC^^ 

TCTCTGAGCCTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCTCTG^ 

CAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATC^GGAGTGACAGAGTCmCTTCTGC^GCACCTGCTTCTCC^ 
CC^CTGTCCCTTCCATCTTGGAATGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGC^TTTCCTGGAGAC 
GGCAGGCTTAGGTCTCACTGACAGCATGCCAGAC^CAACTGAATCGAAGCAGGCCTGAAGCCTAGGTC^ 
GTCCAGCCCCAGGAGGCAAAGTCACCAATGCAGGGAGGTAAATGCCTTTTGGCAGGAAAA^ 

GGGAGTCAGGGGTGGGAGGAGAAGGAGGAAGAGGAGGAAGGCCAGACTGGCCTGCCCTTTCTCCCATACTTCACCCCAGC 

AGAGGTTCATGGGAC^CAGTTGGAAAGCCACTGGGAGGAAATGCCTCACTACAGGGGGGCCTCCTGT^ 

GGTAAT CCTCCTAATGAAC C CACAAGG TCAATTCACAACTGATAT CTTAGCTATTAAAGAAGTAC TGACTTTAC CAAAAG 

AATCATCAAGAAAGCTATTTATATAAACCCCCTCAGTCATTTTGAAATAAAATTAATTTTAC 
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The predicted amino acid sequence of FCTR2 protein corresponding to the foregoing 
nucleotide sequence is reported in Table 2B. FCTR2 was searched against other databases using 
SignalPep and PSort search protocols. The protein is most likely located in the mitochondrial 
matrix space (certainty=0.4718) and seems to have no N-terminal signal sequence. The predicted 
5 molecular weight is 90346.9 Daltons. 



Table 2B. FCTR2 Protein Sequence (SEQ ID NO:4). 

MQCDVGDGRLFRLSLKRALSSCPDLFGLSSRNELIaASCGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCK 
LHRAACLLGKRITVIHSKIiCFLKGDTCTM 
1 0 AQHVLKKQDLDEDLLGCS PGDLLRFDDYNSDS SLTLREFYMAFQWQLSIAPEDRVSVTTVTVGLSTVLTCAVHGDLRPP 1 1 WKRN 
GLTLNFLDLEDINDFGEDDSLYITKVTTIHM 

I TWLKNGVDVSTQMS KQLSLLANGSELH I S S VRYEDTGAYTC I AKNE VGVDED ISSLFI EDS ARKTLANI LWREEGLS VGNMF YVF 
SDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWSAVlSrVRNRYIWAQPALSRV 

PAKLSYDKSHDQVWLSWGDVHKSRPSLQVITEASTC 
15 MPLKTIGLHHHGOTPQAMAHTHLGGYFFIQCRQDSPASAARQLLVDSVTDSVLGPNGDVTGTPHTSPDGRFIVSAAADSPWLHVQE 
ITWGEIQTLYDLQINSGISDIAFQRSFTESNQYNIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLF 
GQYLLTPARESLFLINGRQNTLRCEVSGIKGGTTWWVGEV 



In a BLASTN search it was also found that nucleotides 784-5502 of FCTR2 nucleic acid 
20 had 4672 of 4719 bases (99%) identical to Homo sapiens mRNA for KIAA1061 protein, partial 
cds (GenBank Acc:AB028984) (Table 2C). 



Table 2C. BLASTN of FCTR2 against Homo sapiens mRNA for KIAA1061 protein (SEQ 

IDNO:46) 

> GI | 5689458 [ DBJ j AB028984 ,1 [AB02 8984 HOMO SAPIENS MRNA FOR KIAA1061 PROTEIN, PARTIAL 
25 CDS 

LENGTH - 4719 

SCORE = 9075 BITS (4578), EXPECT = 0.0 
IDENTITIES = 4672/4719 (99%) 
30 STRAND = PLUS / PLUS 



35 



40 



50 



QUERY : 


784 


SBJCT : 


1 


QUERY : 


844 


SBJCT : 


61 


QUERY : 


904 


SBJCT : 


121 


QUERY : 


964 


SBJCT: 


181 


QUERY: 


1024 


SBJCT : 


241 


QUERY : 


1084 


SBJCT : 


301 



MMIIIMI IIMIillMIIII IIMIMMIIIII lllll!l!IIIIIMMI Ml 

AGAATGTCCTTCTGGCACTCCAGACCCGTCTGCAGCCACTCCAAGAAGGAGACAGCAGAC 6 0 



MIIIIIMIIMMIIIMIIIMIIIIMIIIIMIIIIIIIMMIIMIIMMII 

AAGACCCTGCCTCCCAGAAGCGCCTCCTGGTGGAATCTCTGTTCAGGGACTTAGATGCAG 120 



IMIIIIMMIIIIINIIIIIIIMMIIMIII1MIMMMII11IIIII1I1 

3GCAATGGCCACCTCAGCAGCTCCGAACTGGCTCAGCATGTGCTGAAGAAGCAGGACC 

3ATGAAGACTTACTTGGTTGCTCACCAGGTGACCTCCTCCGATTTGACGATTACAACA 
45 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | M | | | | | | | | | | M II I I 1 I i II I I I I 

3ATGAAGACTTACTTGGTTGCTCACCAGGTGACCTCCTCCGATTTGACGATTACAACA 

3ACAGCTCCCTGACCCTCCGCGAGTTCTACATGGCCTTCCAAGTGGTTCAGCTCAGCC 

IMIIMMMIMIIIMMIIIIMIIIIIIIMMIMIIIIIMIIIIIIIMI 

SACAGCTCCCTGACCCTCCGCGAGTTCTACATGGCCTTCCAAGTGGTTCAGCTCAGCC 
jCCCCCGAGGACAGGGTCAGTGTGACCACAGTGACCGTGGGGCTGAGCACAGTGCTGA 

MMIMIIIIIIIlllllllllMIIIMIIIIIIIIIIllllllllllllllllll 



55 
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QUERY : 


1144 


LL IbbbCbb 1 bCAibbAbALL IbAbbbLALLAAl Liil L I LL AAL L. L L-AALLLL L. 1 LALLL, 




SBJCT : 


361 


IIIIIMIMIIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIMIIIIIIMIIIIII 

CCTGCGCCGTCCATGGAGACCTGAGGCCACCAATCATCTGGAAGCGCAACGGGCTCACCC 






1 o A /l 


1 LAAL 1 X LL 1 LLAL- 1 X LLAriLALA 1 L.ri/-1 1 LriL ±11 OLrtLrlLLra 1 Vjri-i.1 1 lui Jr\<^\ 1 V-i-i. 

lllllllllllllillllllllllllllllllllllllllllllMIIIIIIMIMlll 

i bAAL. 1 I LL I bbAb 1 1 bbAAbALA 1 LAB. 1 bAL 1 1 ILLALALLAiLAl 1 LLL 1 L 1 ALA1 


1263 


SBJC1 : 


421 


Aft n 


QUERY : 


1264 


/■-»/— 171 -j\ «-i/-im/-« tv /~«/^t\ ti m OO A /-*7i irt/nn/^/TA 7\ mrri 7\ /~i 7\ « I i^"*t OO A T/* 1 OT'T'OOOO OO A OO A O O 7V O O 

CCAAGGTGACCACCAi CCACAlbbbLAAi lACALLlbbCAlbbi 1 bLbbbLAbbAbLAbb 


1 lOT 


SBJCT : 


481 


MMIMIIIIIIIIIIMIIIMIMIMIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

CCAAGGTGACCACCAI CCACAlbbbLAAi lACACLlbbCAlbCi I LCbbCLACbAbbAbb 


D4t U 


QUERY : 


1324 


rporpT'oo a o a oooa /^/^mriz-irri^ ^/-im/^i 71 7\ rp/~irp/~i ooooo aot*o A. r Pf r "*/"*r* r PO r r , r' 1r P a, ^vr^r^'AC a.oa 
lb I I LCAbACCCALb 1 Cb i bCAbb I (jMIu 1 bbbbbbAb 1 Uft I LLb 1 b 1 b 1A1 LLHuAuH 




SB JCT : 


541 


IIIIMIMIMIMIMMIMIMIIIMIIIIIIIMIMIIIIIIIIIIIIIIIII 

rpO' pm/"i/-i tv r~i A r^OO A OOT'OOT'OO A OOT>0 A 7\ rrr/^rri/^»/*^ /*^<~*7\ /^ 1r Pr~*7\ l T 1 /~ , r~'r*' r Pi^ ,r nr 1r r , 7\ TPPAPTinA 

TGTTCCAGACCCACGTCCTGCAGGTGAA i G 1 GCCGCCAvj 1 LAi CC(j iblLiAI CCALrAtjA 


£T n n 
bUU 


QUERY : 


1384 


GCCAGGCACAGGAGCC 1 GGAG 1 GGUAGCCACjCC X AAGA1 GCUAlvjC 1 (jAvjLtoCA 1 1 CCCA 


1 A A 7 


SB J CT : 


601 


GCCAtjtjrCACAGtjACjCL i CjoALt 1 (jVjCAUCCAvjCC 1 AAvjAI oCCAl (aL, I (jA^Lrvjv-A 1 1 ULUi 




QUERY : 


1444 


1 CjCCCACjAAI CAC 1 1 Cjv^C 1 (jAAAAACtjljCU 1 LjvjAI 1 C 1 1 LabAlu 1 LUiwiLiibL 


1 CAT 


SB JCT : 


661 


MIIIIIIIIIMMIMIIIIIMIIIIIMIMIIIIIIMIIIIIIIIIMIIIIII 

TGC C CAGAATCACTTGG CTGAAAAACGG CGTGGATGT CT CAACT CAGATGTCCAAACAGC 


n *i n 


QUERY : 


1504 


TCTCCCTTTTAGCCAATGGGAGCGAACTCCACATCAGCAGTG1 TCGG IAIGAALjACACAIji 




C T> TOT" _ 

SBJCT : 


721 


1 M 1 M MM M i: II M MM! Ml! MINN M III Mil MM il !!ll 1! II Ml 

IC1CCC1 I 1 1 AbLLAAl bbbAbtbAAL 1 LLAUA 1 UAbuAb lb 1 1 tbb iAlbAHbALaUib 


7ftH 


QUERY : 


1564 


GGGLAIACACC IvjCAi ibttiwWiibAAblbbbIbibbAibAAbAiAiblLLlLbbJ.Ll 


luZ j 


SBJCT : 


781 


MMIMMMM M MMMIMMIIIIIIIiilMMIMMMMMMM!!! 

PPPPATAPA /~*/~*T/~~* r*T\ rprpp ^n/-17\ A A A A T*/"* A A prppppTiprppp 7\ rpp A APA'PA T , ^^ 1^ T , ^ ,, P'T , ^^*/^*^^ r T , /^ ,, T , 

GGGCATACACCTGCA x i GCCAAAAA l GAAG i GGvj 1 G 1 GGA J. GAAGA i A i C 1 CC i CbC ibl 


q a n 


QUERY : 


1624 


TCATTGAAGACTCAGCTAGAAAGACCCTTGCAAACATCCTGiGGCGAGAGGAAGCjCC 1 CA 


iboj 


SBJCT : 


841 


MIMM Ml! MIMMMMMMMMiMIMMMMMMilll IIMIMI II 

TCATTGAAGACTCAGCTAGAAAGACCCTTGCAAACATCCTGTGGCGAGAGGAAGGCC I CA 


q n n 


QUERY: 


1684 


n nnrnrtnn "A 7\ ?\ /"ttv m^~i mm tv mpmpttifrprnnPP tv w a pipprpTi cn/~t A rnppmPTi mnrtTi rpO/^rri/^irp»~</~i 

gcgtgggaaacatgttctatgtcttctccgacgacggtatcatcgtcatccat CC IG iGG 


1/43 


SBJCT : 


901 


II MMMMMMIMMIMM MMIMIIMMMIIIIIIMIMIIIIMM! 

GCGTGGGAAACATGTTCTATGTCTTCTCCGACGACGGTATCATCGTCATCCATCC I b 1 bbr 


Q £T A 

y D U 


QUERY : 


1744 


A /~ i TT^ , T 1 !^ 1 * A/^AT'OOAOA /~*P^ O A OOT*/^ A A A PPPA f^r+C 1 AAA APATT 1 1 M 1 'PATP APPTATP A A ^ A A A 

AC lbl bACrA 1 C CAbAbLrCACC 1 UAAACCCAUbUrAAAAvjxA ilil LAi bAbL i A 1 vjrAAL7AAA 


lOUJ 


SBJCT : 


961 


;M MMMIMMMMMMIM MIIMliM! IMIMIIIMMMIIIIIMM 

ACTGTGAGATCCAGAGGCACCTCAAACCCACGGAAAAGATTTTCATGAGCTATGAAGAAA 




QUiiKY : 


1804 


r P/~'TV« l T»/T* r T , r ,1 A A A O A r^XTNTT\TNTNTNTTvT r nr' r 1 A A 0/^0 AO PpprriPPp7\PrTipr(r<tTiA TPTPPAPTPB A TT 1 

lCiCjiCCI CAAAbAbWiNiNiNiNiNiN i bLAALLLAbLLL i bLLAb 1 bLrb 1 Al L, 1 bLAb 1 LAAib 

IIIIMMIIMM Ml IIIIMMMMMMMMII MMIIMIMM 

TCTGTCCTCAAAGAGAAAAAAATGCAACCCAGCCCTGCCAGTGGGTATCTGCAGTCAATG 


1 ft£7 
loo J 


SBJCT : 


1021 


1 A ft A 


QUERY : 


1864 


rn/*^OOOA A PPPP rn A O A rnprriTi rnrimppp/npA 0 0/*<A OZ**A pmPA PPAPAPT'PP'T'fpPT'PPTPPA OA 

TCCGGAACCGGTACATCTATGTGGCCCAGCCAGCACTGAGCAGAG1 CC 1 1 G I GG I CGACA 

IMi Ml II MMMIMMI M M MM II M M 1 1 II III! MM MMMMIMI M 

TCCGGAACCGGTACATCTATGTGGCCCAGCCAGCACTGAGCAGAGTCCTTG1 GG I CGACA 




fin ~r/~tm - 

SBJCT : 


1081 


i i a r\ 
114 U 


QUERi : 




rr»00 A AOOOOAO A A A O TOOT 1 A OAOT'OOAT'AOO T'O T'O O A (~^r~ , f~ , T i r~ ,r Pr^ f^r^f^r* <~ ir V7\ APPTPTPPT 

1 LLAAbLLLTibAAAblLLlALAblLLAlAbblblbbALLL 1 L- lbL,L.LrtjL, lAAbL lbl L-b I 




SBJCT : 


1141 


MMMIM MiMilMIMMMMMM MMMMIMI MMMMMIIMM 

rpoOA APPPPAPA A A PfPPPrpA OAOTIOOA rrlA OOT'OT'OO A OOOT'OT^O r^r^f^f* OHP A APPTprpppqi 

TCCAAGCCCAGAAAGTCCTACAGTCCA1 AGG 1 G 1GGACCC 1 C I GCCGGC i AAGC i G 1 CC 1 


1200 


ATTDDV . 


1984 


A T'O A O A A OT'O A OA T'O A OO A A PTPTPPPTPPTP A P PTPPPPPP A PPTPP A OA AOT^OOOO A O 

A 1 LrACAAb l CACA 1 bAL LAAb lbl bbb 1 CC 1 bAbL 1 bbbbbbACb 1 bCACAAb 1 LLLbAL 


O A A "2 
U4 J 


C T~> TOT* . 

SBJCT : 


1201 


MIMM IMMIIMIIIIMMIIIMMII MMMMIMMMIMIMIIMM 

A T'O A OA A OT'O A OA T'O A OOA A prpnrnAppmnpmA7\ Apmppp Apft 71 /"*t/~lrri/~l 71 7s /-*rp<^*r-ir-t/~* 71 f~i 

Al GACAAG rCACATGACCAAGTGTGGGTCC 1 GAbCTGbGGGGACb i GCACAAb 1 CCCbAC 


1 o/*r\ 

izb U 


nTTT?DV . 

QUERY : 


2 044 


OA AOT , O r POOAOOT l O A T'O A OAOA AOOOAOOA r^r^r^f* f^r^Ty /^Ti r*/^/" 1 ^ OOA pprpPA m/-t,-i»^i r~i7\ O 

CAAG I C 1 CCAGG 1 GA I CACAGAAbCCAbCACCbbCCAGAGCCAGCACC 1 CA1 CCbCACAC 


^ 1U 5 


on tot . 


l^O 1 


III MMMM MIMMIMI IMMM MMMMMMMMMMMMIMM II 

OA APTPTPPAPPTP ATPAPAPA AOOOAOOA r^/^f^r^ OOAO AO OOAO OA PPTPA T'OOOO A O A O 

CAAG I C i CCAbb 1 GA 1 CACAbAAbCCAbCACCbbCCAGAGCCAGCACC 1 CA 1 CCbCACAC 


1 iz U 


PlTTPDV . 

QUEKi : 


2104 


ppminii'ippTipp 71 OT'OO A T'O A 1 1 " 1 " i 'O't " 1 'PA'I'TPPPPPA A OA A A PPTPA T'O A T'O A APPAPATPA 

CC1 1 IbCAbbAbibbAlbAi 1 ICi 1CA1 1 LLLLLAALAAALL 1 UAi UA1 LAALLALA i LA 


Z lO J 


SBJCT : 


1321 


IIIMMIIMMIIIMIIIIIIIMIMMIIIMMIIMIIIMIIMMMMII 

CCTTTGCAGGAGTGGATGATTTCTTC^TTCCCCCAACAAACCTCATCATCAACC^ 


1380 


QUERY : 


2164 


GGTTTGGCTTC^TCTTCAACAAGTCTGATCCTGCAGTCCACAAGGTGGACCTGGAAACAA 


2223 
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SBJCT: 1381 GGTTTGGCTTCATCTTCAACAAGTCTGATCCTGCAGTCCACAAGGTGGACCTGGAAACAA 1440 
QUERY: 2224 TGATGCCCCTCAAGACCATCGGCCTGCACCACCATGGCTGCGTGCCCCAGGCCATGGCAC 22 83 

lllMMIMIIIMIIMIIIIllMIMMMllllillMIMMIIIIIIMIlM 

SBJCT: 1441 TGATGCCCCTCAAGACCATCGGCCTGCACCACCATGGCTGCGTGCCCCAGGCCATGGCAC 1500 
QUERY: 2284 ACACCCACCTGGGCGGCTACTTCTTCATCCAGTGCCGACAGGACAGCCCCGCCTCTGCTG 2343 

IMIIIIiMMIIMIMMMMIIillllMMMMlMMMMMMMIMK 

SBJCT: 1501 ACACCCACCTGGGCGGCTACTTCTTCATCCAGTGCCGACAGGACAGCCCCGCCTCTGCTG 1560 
QUERY: 2344 CCCGACAGCTGCTCGTTGACAGTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATGTAA 2403 

IMIIMIimillllllMIIIMIIIIIIMIIIIIMilllllllMIIMIIIII 

SBJCT: 1561 CCCGACAGCTGCTCGTTGAC^GTGTCACAGACTCTGTGCTTGGCCCCAATGGTGATGTAA 162 0 
QUERY: 2404 CAGGCACCCC&CAC&CATCCCCCGACGGGCGCTTCAT^^ 2463 

lillllMiMIIIMIIIIIIMIIMMIllllllllllllllllllllllllIMM 

SBJCT: 1621 (^GG(^CCCC^C^CA(^TCCCCCGACGGGCGCTTCATAGTCAGTGCTGCAGCTGACAGCC 168 0 
QUERY: 2464 CCTGGCTGCACGTGCAGGAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGC 2523 

IIMIIlllllMllllilllMIIMIIIIIMllllllllllllllMIIIIIIIM! 

SBJCT: 1681 CCTGGCTGCACGTGCAGGAGATCACAGTGCGGGGCGAGATCCAGACCCTGTATGACCTGC 1740 
QUERY: 2524 AAATAAACTCGGGCATCTCAGACTTGGCCTTCCAGCGCTCCTTCAC^ 2583 

l!lllllllMil!lllM!l!!IIIIM!MlMIMII!ll!ll!lllll!l!!!lll 

SBJCT: 1741 AAATAAACTCGGGCATCTC^GACTTGGCCTTCCAGCGCTCCTTCACTGAAAGCAATCAAT 1800 
QUERY: 2584 ACAACATCTACGCGGCTCTGCACACGGAGCCGGACCTGCTGTTCCTGGAGCTGTCCACGG 2643 

M M It I i 1 1 1 i I M f I M M I M I M i 1 1 M i I M 1 1 i M I M M I M 1 M 1 ! i 1 M 1 1 

SBJCT: 1801 ACAACATCTACGCGGCTCTGCACACGGAGCCGGACCTGCTGTTCCTGGAGCTGTCCACGG 1860 
QUERY: 2644 GGAAGGTGGGC^TGCTGAAGAACTTAAAGGAGCC^CCCGC^GGGCCAGCTCAGCCCTNNN 2703 

! 1 1 M 1 1 M i 1 1 1 f ! f 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 ! 1 1 E I M M ! I E I ! M I i 1 1 M I M 1 1 

SBJCT: 1861 GGAAGGTGGGCATGCTGAAGAACTTAAAGGAGCCACCCGCAGGGCCAGCTCAGCCCTGGG 1920 
QUERY: 2704 NNI^TACCCACAGAATCATGAGGGACAGTGGGCTGTTTGGACAGTACCTCCTCACACCAG 2763 

IIMMMMllllllllMIIMIIIIIIIMIIllllllMIIMIilMIMI 

SBJCT: 1921 GGGGTACCCACAGAATCATGAGGGACAGTGGGCTGTTTGGACAGTACCTCCTCACACCAG 1980 
QUERY: 2 764 CCCGAGAGTCACTGTTCCTCATCAATGGGAGACAAAACACGCTGCGGTGTGAGGTGTCAG 2823 

MIMIMIIIIMlllMMMIMIIIMMIIIMMIMIilllllMMIIllll 

SBJCT: 1981 CCCGAGAGTCACTGTTCCTCATCAATGGGAGACAAAACACGCTGCGGTGTGAGGTGTCAG 204 0 

QUERY: 2 824 GTATAAAI^NNNNNACCACAGTO^ 2 883 

I I I i I I i I I I I I I I I I I I I t t I 1 I f I 1 t 1 I I f I f I 1 I i I I I I 1 i I f J t 1 I i f i 

SBJCT: 2 041 GTATAAAGGGGGGGACCACAGTGGTGTGGGTGGGTGAGGTATGAAGGGCCCAGAGCAGAG 2100 

QUERY: 2 884 C C CTGGGC CAAGGAACAC C C C C TAGT CCTGACACT G CAG C CT CAAG CAGGTACGC TGTAC 2 943 

I i 1 1 1 1 1 1 E 1 f 1 1 1 1 1 i 1 1 1 1 1 M 1 1 f 1 1 1 1 i 1 1 1 i i E 1 1 1 1 1 1 1 1 ! 1 1 1 M E 1 1 1 1 1 1 1 

SBJCT: 2101 CCCTGGGCCAAGGAACACCCCCTAGTCCTGACACTGCAGCCTCAAGCAGGTACGCTGTAC 2160 
QUERY: 2944 ATTTTTACAGACAAAAGCAAAAACCTGTACTCGCTTTGTGGTTCAACACTGGTCTCCTTG 3 003 

IIMIIIIMIIIIIIMIIIIMMMMMIIIIIMIMIIIMIIIIIMIIMM 

SBJCT: 2161 ATTTTTACAGACAAAAGCAAAAACCTGTACTCGCTTTGTGGTTCAACACTGGTCTCCTTG 2220 
QUERY: 3 004 CAAGTTTCCTAGTATAAGGTATGCGCTGCTACCAAGATTGGGGTTTTTTCGTTAGGAAGT 3063 

IIIIIIIIIIIIMMIMMIMlllllllllllllllMMIMMMilllllilll 

SBJCT: 2221 CAAGTTTCCTAGTATAAGGTATGCGCTGCTACCAAGATTGGGGTTTTTTCGTTAGGAAGT 2280 
QUERY : 3 064 ATGATTTATGCCTTGAGCTACGATGAGAACATATGCTGCTGTGTAAAGGGATCATTTCTG 3123 

IIMIIIIIMMIIMIIIIMIIMIIMIMIIMIIIMMMIIIIIIIIMIII 

SBJCT: 22 81 ATGATTTATGCCTTGAGCTACGATGAGAACATATGCTGCTGTGTAAAGGGATCATTTCTG 234 0 
QUERY: 3124 TGCCAAGCTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCTCCA 3183 

I I i i m r 1 1 m 1 1 1 1 1 f 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 f ( 1 1 1 1 i 1 1 f i f i i 1 1 1 1 1 1 1 1 

SBJCT: 2341 TGC CAAG CTGCACACCGAGTGACCTGGGGACATCATGGAACCAAGGGATCCTGCTCT CCA 2400 
QUERY: 3184 AGCAGACACCTCTGTCAGTTGCCTTCACATAGTCATTGTCCCTTACTGCCAGACCCAGCC 3243 

IMIIMIIIilllllllllllllllMIMlMlllllllllllllMMIMIIIIII 

SBJCT: 2401 AGCAGACACCTCTGTCAGTTGCCTTCACATAGTCATTGTCCCTTACTGCCAGACCCAGCC 2460 



17 
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QUERY: 3244 AGACTTTGCCCTGACGGAGTGGCCCGGAAGCAGAGGCCGACCAGGAGCAGGGGCCTCCCT 33 03 

lIMiMIMMNIIIMilMMMIMIMIINMIIllllMMMlllllllll 

SBJCT: 2461 AGACTTTGCCCTGACGGAGTGGCCCGGAAGCAGAGGCCGACCAGGAGCAGGGGCCTCCCT 2520 
QUERY: 33 04 CCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTGCTT 33 63 

MlMMllMIIIIMIIIIIIIllllMIIIMMMMIIMllllllliMMMI 

SBJCT: 2521 CCCGAACTGAAAGCCCATCCGTCCTCGCGTGGGACCGCATCTTCTCCCTCGCAGCTGCTT 2580 
QUERY: 3364 CTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTG 3423 

! 1 1 [ 1 1 1 1 1 II 1 1 M 1 1 1 1 1 ! I i 1 1 II 1 1 M M M ! M : M I ! I M 1 1 M 1 1 1 1 1 M I ! I 

SBJCT: 2581 CTTGCTTTTCTTTCCATTTGACTTGCTGTAAGCCTGAGGGAGAGCCAACAAGACTTACTG 2640 
QUERY: 3424 CATCTTGGGGGATGGG^AAATCACTCACTTTATTTTGGAAATTTTTGATTNNNNNNNNNT 3483 

IIMIIIIIIIIIIMIIIIIMIIIIIIIIIMIIIIIIIMIIIIIII I 

SBJCT: 2641 CATCTTGGGGGATGGGGAAATCACTCACTTTATTTTGGAAATTTTTGATTAAAAAAAAAT 2700 
QUERY: 3484 TTTATAATCTCAAATGCTAGTAAGCAGAAAGATGCTCTCCGAGGTCCAACTATATCCTTC 3543 

lllMlMlMIMIMlMllliMIMMMIIIMllMllliMMMlMMMI 

SBJCT: 2701 TTTATAATCTCAAATGCTAGTAAGCAGAAAGATGCTCTCCGAGGTCCAACTATATCCTTC 2760 
QUERY: 3 544 CCTGCCTTAGGCCGAGTCTCGGGGGTGGTCACAACCCCACATCCC^CAGCCAGAAAGAAC 3603 

[ I M i 1 1 1 1 [ 1 1 M It M M 1 11 1 1 1 1 M M M 1 1 E I M M M t [ M 1 1 1 11 1 ! 11 1 1 M 

SBJCT: 2761 CCTGCCTTAGGCCGAGTCTCGGGGGTGGTCACAACCCCACATCCCACAGCCAGAAAGAAC 2820 
QUERY: 3604 AATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCACCCTGCTTCTCCAAGAGCAG 3663 

IlilllMIIMIlllMMIMIIIIIMIMIIIIIIMIIIIIllllllMIMIll 

SBJCT: 2821 AATGGTCATCTGAGAATACTGGCCCTGTCGACTATTGCCACCCTGCTTCTCCAAGAGCAG 2880 
QUERY: 3664 ACCAGGCCACCTCATCCGTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACCAGAACA 3723 

IMIIIMIMIMMIIIMMMIIIIMIIIIIMMIMIIMMIIIIIIMIM 

SBJCT: 2 881 ACCAGGCCACCTCATCCGTAAGGACTCGGTTCTGTGTTGGGACCCCAAAAAACCAGAACA 294 0 
QUERY: 3 724 AGTTCTGTGTGCCTCCTTTCAGCACAGAAGGGAGACATCTCATTAGTCAGGTCTGGTACC 3783 

MllllllMMMIMIIIIIIIMIMilllMIIIIMIIlllMMIllllllMI 

SBJCT: 2941 AGTTCTGTGTGCCTCCTTTCAGCACAGAAGGGAGACATCTCATTAGTCAGGTCTGGTACC 300 0 
QUERY: 3 784 CCAGATTCAGGGCAGACTGGGCTTGCCTGGCAAGGTATGGGTGGCCTCCAGGCTCAATGC 3843 

IIIIMllllllMllillllllllllllllMIIMIIIMIIIIIIIlllllMIMl 

SBJCT: 3001 CCAGATTC^GGGC^GACTGGGCTTGCCTGGCAAGGTATGGGTGGCCTCCAGGCTCAATGC 3 060 
QUERY: 3 844 AGAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAAC 3 903 

MlllllllllllllllllMlllllllMIMMIIIIIIIIlllllMliliKKN 

SBJCT: 3 061 AGAAACCCCAAGGACACGAGTGGGGCCAGGTGAGTTCCTGAAGCTATACCTTTTCAAAAC 312 0 
QUERY: 3904 AGATTTTGTTTTCCTACCTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCGCATCA 3 963 

MIIMIMMIIIMIIIIIMMMIIIMMIMIIIMIIIIIIIIMMIMIM 

SBJCT: 3121 AGATTTTGTTTTCCTACCTGTGGCCCATCCACTCCTCTCTGGTACCCCATCCCCGCATCA 3180 
QUERY: 3 964 GCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTCTTACCCACATTCCCCAATCAATAC 4023 

MIIIIMMIIMMIIMIMMIIIIIMMIIIIMIIIIMIIIMMMIIIM 

SBJCT: 3181 GCACTGCAGAGAGAACACATTTCGGCGAGGGTTTTCTTACCCACATTCCCCAATCAATAC 3240 
QUERY: 4024 ACACACACTGCAGAACCCAGAACAGAAGGCCACAGGCTGGCACTACTGCATTCTCCTTAT 4083 

IIIIMMIMfllllllllMllllllllMlllllllllMIIIMIIIIIIIIIlll 

SBJCT: 3241 AC^C^CACTGCAGAACCCAGAACAGAAGGCCACAGGCTGGCACTACTGCATTCTCCTTAT 3300 
QUERY: 4084 GTGTCTCAGGCTGTGGTGACTCTCACATGGGCATCGAAGAAGTACAACCCACATAGCCCT 4143 

IIIMIIIIIIIMIIMIIMIIMIIIMIIIIIIMMIMMMIIIIIMIMII 

SBJCT: 3301 GTGTCTCAGGCTGTGGTGACTCTCACATGGGCATCGAAGAAGTACAACCCACATAGCCCT 3360 
QUERY: 4144 CTGGAGACCGCCTAGATCAGAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATA 4203 

IIIIIMIIMMIIIMIMIIIIIIIMIIIIIIIMIIIMMIIMIIIIMIIII 

SBJCT: 33 61 CTGGAGACCGCCTAGATC^GAGACTCAGCAAAAACAGGCTCGCCTTCCCTCTCCCACATA 3420 
QUERY: 4204 TGAGTGGAACTTACATGTGTCCTGGTTTGAATGATC^TTTTGCAAGCCACACGGGTTGGG 4263 

MMlMMIIIMMIIIMMIIIMMMIMIIllllllMMIIIIMMMMl 

SBJCT: 3421 TGAGTGGAACTTACATGTGTCCTGGTTTGAATGATCATTTTGCAAGCCACACGGGTTGGG 34 80 
QUERY: 4264 AGAGGTGGTCTCACCACAGACGTCTTTGCTAATTTGGCCACCTTCACCTACTGACATGAC 4323 

IIMIllllllMIIIMMIllllMMMIHIIMMIIMIIIIMIIIlllMIl 
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SBJCT: 3481 AGAGGTGGTCTCACCACAGACGTCTTTGCTAATTTGGCCACCTTCACCTACTGACATGAC 3540 



10 



30 



50 



70 



QUERY : 4324 CAGGATTTTCCTTTGCCATTAAGGAATGAACTCTTTCAAGGAGAGGAAACCCTAGACTCT 43 83 

IIIMillllllllllllllllllllillllllllllllllllMIIMMIillllMI 

SBJCT: 3 541 CAGGATTTTCCTTTGCCATTAAGGAATGAACTCTTTCAAGGAGAGGAAACCCTAGACTCT 3600 
QUERY * 43 84 GTGTCACTCTCAACACACACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCAT 4443 

MMMIIMllMIIIIMMIlllMIMMIIillMIIMMIIIIIMIIMIM 

SBJCT: 3601 GTGTCACTCTCAACACACACAGCTCCTTTCACTCCTGCCTGACTGCCAAGCCACCTGCAT 3660 
QUERY: 4444 CCCCCGCCCCAGATCTCATGAGATCAATCACTTGTATGTCTCACGCAACTTGGTCCACCA 4503 

MIIMIiMIMIMIIMIMMIIMIllMMIMMIIIIMIIIMIIIIMII 

SBJCT: 3661 CCCCCGCCCCAGATCTCATGAGATCAATCACTTGTATGTCTCACGCAACTTGGTCCACCA 372 0 



15 QUERY: 4504 AACGCCTGTCCCCTGTAACTCCTAGGGGTGCGCCTAGACAGGTACGTCTGTTTTTTATTT 4563 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIII 

SBJCT : 3 721 AACGCCTGTCCCCTGTAACTCCTAGGGGTGCGCCTAGACAGGTACGTCTGTTTTTTATTT 378 0 
QUERY: 4564 TAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAATGCAG 4623 

20 II 1 1 ! I ! II 1 1 1 1 1 1 1 1 1 M M M 1 1 1 1 1 1 1 1 M ! I M M 1 1 1 i 1 1 1 1 M I II M 1 1 1 1 1 

SBJCT: 3781 TAAAAGATATGCTATGTAGATATAAGTTGAGGAAGCTCACCTCAAAAGCCTAGAATGCAG 3 840 
QUERY: 4624 TTTC^CAGTAGCTGGGATGCATGGATGACCCATCTCACCCCI^^ 4683 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIII 

25 SBJCT: 3841 TTTCACAGTAGCTGGGATGCATGGATGACCCATCTCACCCCTTTTTTTTTCCTGCCTCAA 3900 
QUERY: 4684 TATCTTGATATGTTATGTTTACTCCCAATCTCCCATTTTTACCACTAAAATTCTCCAACT 4743 

I [ 1 1 1 1 1 1 1 i 1 1 1 1 r 1 1 1 i 1 1 1 1 1 1 i 1 1 1 i 1 1 1 E 1 1 i 1 1 i 1 E 1 1 1 1 f I i i 1 1 1 1 1 1 1 1 1 1 

SBJCT: 3 901 TAT CTTGATATGTTATGTTTACT C C CAAT CTCCCATTTT TAC CACTAAAATT CTC CAACT 3 96 0 



QUERY: 4744 TTC^TAAACNISTN^^ 4 803 

IMMIIM MIIMIIIIMIIIMIIIIIllllMIIIIIIMIIIIIII 

SBJCT: 3 961 TTCATAAACTTTTTTTTGGAAAAATTTCCATTGTATCAGCCCCTGACAGAAAAAGGATCT 4020 



35 QUERY: 4 804 CTGAGCCTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCTCTGGG 4 863 

MMIIIIIIMMIMMIIMIIIIMIIIIMIIIIIMIIIMIIIIIMIIIMI 

SBJCT: 4 021 CTGAGC CTAAAGGAGGAAAAGTCCCACCAACTACCAGACCAGAACACGAGCCCCT CTGGG 4080 
QUERY: 4 864 CAGCAGGATTCCTAAGTCAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATCAGG 4923 

40 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBJCT: 4 0 81 CAGCAGGATTCCTAAGTCAAAGACCAGTTTGACCCAAACTGGCCTTTTAAAATAATCAGG 4140 
QUERY: 4924 AGTGACAGAGTCAACTTCTGCAGCACCTGCTTCTCCCCCACTGTCCCTTCCATCTTGGAA 4983 

MlliMIIIIIIIIIIIIIIIIMIMMIMIIMIilMIIIIIIIIMMIIIlll 

45 SBJCT: 4141 AGTGACAGAGTCAACTTCTGCAGCACCTGCTTCTCCCCCACTGTCCCTTCCATCTTGGAA 4200 
QUERY: 4 984 TGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGCATTTCCTGGAGA CGGC 5043 

IMIIIIMIIIIIIIMIIIIIIIIIIIMIMIIIIIIMMIIMIIIIIIIIIMI 

SBJCT: 4201 TGTGTCTAAAAAAGCATAGCTGCCCTTTGCTGTCCTCAGAGTGCATTTCCTGGAGACGGC 4260 



QUERY: 5044 AGGCTTAGGTCTCACTGACAGCATGCCAGACACAACTGAATCGAAGCAGGCCTGAAGCCT 5103 

1 1 1 i I E I i I f 1 1 1 1 1 1 r 1 1 1 i 1 1 1 1 1 i i E 1 1 i E 1 1 1 1 f t E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I i 1 f 1 

SBJCT: 4261 AGGCTTAGGTCTCACTGACAGCATGCCAGACACAACTGAATCGAAGCAGGCCTGAAGCCT 4320 



55 QUERY: 5104 AGGTCAGGGTTTCAGGAGTCCAGCCC CAGGAGGCAAAGT CACCAATGCAGGGAGGTAAAT 5163 

IIIMIillllllMIIIIMIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIII 

SBJCT: 4321 AGGTCAGGGTTTCAGGAGTCCAGCCCCAGGAGGCAAAGTCACCAATGCAGGGAGGTAAAT 4380 
QUERY: 5164 GCCTTTTGGCAGGAAAACCAATAGAGTTGGTTGGGTGGGGAGTCAGGGGTGGGAGGAGAA 5223 

60 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 1 1| 1 1| 1 1 i 1 1 1 1 1 1 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1| 

SBJCT : 4381 GCCTTTTGGCAGGAAAACCAATAGAGTTGGTTGGGTGGGGAGTCAGGGGTGGGAGGAGAA 4440 
QUERY: 5224 GGAGGAAGAGGAGGAAGGCCAGACTGGCCTGCCCTTTCTCCCATACTTCACCCCAGCAGA 5283 

1 1 1 1 i II 1 1 1 1 II 1 1 M 1 1 1 1 i I II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 

65 SBJCT: 4441 GGAGGAAGAGGAGGAAGG C CAGACTGGC CTGC CC TTT CT CC C ATACT TCAC C CCAGCAGA 4 500 
QUERY: 52 84 GGTTCATGGGACACAGTTGGAAAGC CAC TGGGAGGAAATG C CT CAC TACAGGGGGG C CT C 5343 

IMIMIIIIIIIIIIIMMMIIIIIIIMMIIIIIMMMIMIIIMIIIIIM 

SBJCT: 4501 GGTTCATGGGACACAGTTGGAAAGC CAC TGGGAGGAAATGCCTCACTACAGGGGGGCCTC 4560 
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25 



30 



45 



50 



60 



QUERY : 


5344 


SB JCT : 


4561 






SB JCT : 


4621 


QUERY : 


5464 


SB JCT : 


4681 



lllIlllillllMllllllMIMMMIMlMMIiMMMMIIIMIIIIII 

aTAGC^GCCC^GCCGGTAATCCTCCTAATGAACCC^CAAGGTCAATTC^CAACTGAT 462 0 
CTTAGCTATTAAAGAAGTACTGACTTTACCAAAAGAATCATCAAGAAAGCTATTTATA 5463 

E 1 1 i 1 1 1 1 1 1 i I E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i i i I i 1 1 1 1 1 i 1 1 1 1 1 f r M i 1 1 1 f 1 1 

CTTAGCTATTAAAGAAGTACTGACTTTACCAAAAGAATCATCAAGAAAGCTATTTATA 4680 
kACCCCCTCAGTCATTTTGAAATAAAATTAATTTTAC 5502 

10 MIIIMMIMIIMIlMliMMlllllllMIII! 

4681 TAAACCC CCTCAGTCATTTTGAAATAAAATTAATTTTAC 4719 

The FCTR2 amino acid sequence has 473 of 810 amino acid residues (58%) identical to, 
and 616 of 810 residues (76%) positive with, the 850 amino acid residue proteins from Homo 
15 sapiens KIAA1263 Protein fragment (ptnr: TREMBLNEW-ACC:BAA86577) (SEQ ID NO:47) 
(Table 2D). 

Table 2D. BLASTP of FCTR2 against Homo sapiens KIAA1263 Protein fragment (SEQ ID 

NO:47) 

p tnr : TREMBLNEW- ACC : BAA8 6577 KIAA1263 PROTEIN - Homo sapiens (Human), 850 aa 
20 (fragment) 

Length = 850 



Score a 2573 {905.7 bits), Expect = 2.0e-267, P = 2.0e-267 
Identities = 473/810 (58%), Positives = 6X6/810 (76%) 



QUERY : 


10 


SB JCT : 


40 


QUERY : 


70 


SB JCT : 


100 


QUERY: 


130 


SB JCT : 


160 


QUERY : 


189 


SB JCT : 


219 


QUERY : 


249 


SB JCT: 


279 


QUERY: 


309 


SB JCT : 


339 


QUERY : 


369 


SB JCT : 


399 


QUERY : 


429 


SB JCT : 


459 


QUERY : 


489 


SB JCT : 


519 


QUERY : 


549 



III + I ++ II l+l I M I I + M+ I M++ 1 + 

RLRHKEKNQESSRVKGFMIQDGPFGSCENKYCGLGRHCVTSRETGQAECACMDLCKF 

PVCGSDGRFYENHCKLHR7AACLLGKRITVIHSKDCFLKGDTCTMAGYARLKNVLLM 
IHIIII IIIHI++IIMII ++ || ++ | ++ ||| mi i I+++M + N I 

PVCGSDGEFYENHCEVHRAACLKKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLDI 

LQPLQEGDSRQ-DPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDLDEDLI 

35 + +|| ++ | | + | + | 1 1+ + 1 + Mill + +11 I I+I+++I +1 1 

QKYIMQENENPNGDDISRKKLLVDQMFKYFDADSNGLVDINELTQ- VIKQEELGKDI 
SPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLTCAVE 

.+ M++II + I + I I I III 1 1 1 l + l 1 1 1 + + II III Ml 11 + 

40 

QUERY: 249 LRL _ _ 

llllllll + II llllllllll+l Mlllllll l+IIMI+l I+II++III+ 
PPIIWKRNNIILNNLDLEDINDFGDDGSLYITKVTTTHVGNYTCYADGYEQVYQTHII 

NT^PPVIRVYPESQAQEPGVAASLRCHTAEGIPMPRITWLIOSrGVDVSTQMSKQLSLLANC 

IIMIIIIIIIIII + IIII IIIIMIIIII I++ IIIII + I++ ++MII + I III 
^PPVIRVYPESQAREPGVTASLRCHAEGIPKPQLGWLKNGIDITPKLSKQLTLQANC 

LHIS S VR YEDTGAYTC I AKNEVGVDED ISSLF I EDSARKTLAN ILWREEGLS VGNMF} 

+ lll + IIMIil!IIIIMII Illlllllll + IIIIIIIIINIIIIIII +II1II 

raiSNTOYEDTGAYTCIAKNEAGVDEDISSLFVEDSARKTLANILWREEGLGIGNMF^ 
SDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKHATQPCQWVSAVISIVRNRYI^ 

55 M +111 II I++M III + II + II+ +1 + 11+ I + 

qpiecefqrhikpsekllgfqdevcpkaegdevqrcvwasavnvkdkf: 

wdiqaqkvlqsigvdplpaklsydkshdqvwvlswgdvhksrpslqv; 

+II+I+III+I++ ll+l II IIIIIIIIIIIMI + 1+ l+lll 



- PFAGVDDFFIPPTNLI INHIRFGFIFNKSDPAVHKVDLETMM 603 
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II 

9 LAS 



SBJCT : 


579 


LASGNVPHHT IHTQP VGKQFDRVDDFF I PTTTL I I THMRFGF I LHKDEAALQKI DLETMS 


638 


QUERY : 


604 


PLKT IGLHHHGCVPQAMAHTHLGG YFF IQCRQDS PAS AARQLLVDS VTDS VLGPNGDVTG 


663 






+ 111 1 + INI++MIIIIMI 1+ II + + I++II lllll+l 1 Mil 




SBJCT : 


639 


YIKTINLKDYKCVPQSIAYTHLGGYYFIGCKPDSTGAVSPQVMVDGVTDSVIGFNSDVTG 


698 


QUERY : 


664 


TPHTS PDGRF I VS AAADS PWLHVQE I TVRGE I QTLYDLQ INSGI SDLAFQRS FTE S1SFQYN 


723 






11+ MM ++II + M il+lllll +1+ 1 MMMI IIII++III 




SBJCT : 


699 


TP YVS PDGHYLVS INDVKGL VR VQ Y I T I RGE I QE AFD I YTNLH I SDLAFQP S FTE AHQ YN 


758 


QUERY : 


724 


IYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPAR 


783 






M + 1+ l + M + MI + MI i + t + MM M +t ++I 1 1 1 1 1 1 1 l+l I++ 




SBJCT : 


759 


IYGSSSTQTDVLFVELSSGKVKMIKSLKEPLKAEEWPWNRKNRQIQDSGIiFGQYLMTPSK 


818 


QUERY : 


784 


ESLFLINGRQNTLRCEVSGIKGGTTWWVGE 814 








+III+++II 1 1 II++ ++ 1 11+111+ 




SBJCT : 


819 


DS LF X LDGRLNKLNCE I TE VE KGNT VI WVGD 849 





20 Amino acids 123-815 of FCTR2 also have 693 of 693 amino acid residues (100%) 

identical to ? the 693 amino acid residue protein fragment of KIAA1061 Protein from Homo 
sapiens (ptnr: TREMBLNE W- ACC : BAA83013) (SEQ ID NO:48) (Table 2E). 



Table 2E. BLASTP of FCTR2 against KIAA1061 Protein [Fragment] (SEQ ID NO:48) 

25 ptnr : TREMBLNEW-ACC : BAA83 013 KIAA1061 PROTEIN - Homo sapiens (Human), 

693 aa (fragment) . 

Length = 693 

Score = 3623 (1275.4 bits), Expect = 0.0, P = 0.0 
30 Identities = 693/693 (100%) , Positives = 693/693 (100%) 



QUERY: 


123 


NVLLALQTRLQPLQEGDSRQDPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDL 

MMIMMIMillllMIIMIMMMMIIIIIIIIMIIMIIIIIMIIIIMI 


182 


SBJCT : 


1 


NVLLALQTRLQPLQEGDSRQDPASQKRLLVESLFRDLDADGNGHLSSSELAQHVLKKQDL 


60 


QUERY : 


183 


DEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLT 

IIIIIIIIIIMMIIIIMIMIMIIIMMIilllllllllMMMIIIIIIMIi 

DEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQLSLAPEDRVSVTTVTVGLSTVLT 


242 


SBJCT : 


61 


120 


QUERY : 


243 


C^VHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQL 


302 


SBJCT : 


121 


MIlMIIIMIIIIMIMIIillMMMIIIIllMMIIllMMIIINNNN 

CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQL 


180 



QUERY: 3 03 FQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQL 3 62 

45 1 1 1 1 1 1 1 ! II II 1 11 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M ! I ! 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 1 

SBJCT: 181 FQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQL 240 

QUERY: 363 SLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLS 422 

I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I II I II I 1 I I I II I I I I I I I I I 
50 SBJCT: 241 SLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSARKTLANILWREEGLS 300 

QUERY: 423 VGNMFWFSDDGIIVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNV 482 

IMIIIIMMIIMMIIIIIIIIIIIIIIMIIMIIIIMMIIIIMMIIMIM 

SBJCT: 301 VGNMFYVFSDDGI IVIHPVDCEIQRHLKPTEKIFMSYEEICPQREKNATQPCQWVSAVNV 360 



55 



QUERY: 4 83 RNRY I YVAQ PALS RVLWD I QAQKVLQS I GVDPL PAKLS YDKSHDQ WVLS WGDVHKSR P 542 

MIIMMIIIIIIIIIIMIMIMNMIIMIIIIIIIMMMIMIIIIIIMM 

SBJCT: 361 RNRY I YVAQ PALS RVLWD I QAQKVLQS I GVDPLPAKLSYDKSHDQVWVLSWGDVHKSRP 420 



60 QUERY: 543 SLQVITEASTGQSQHLIRTPFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKVDLETM 602 

IMMIMMIIIIIIIMIMIIIIIIMIIMIIIIMIIIIMMIMIIIIIIIII 

SBJCT: 421 SLQVITEASTGQSQHLIRTPFAGVDDFFIPPTNLIINHIRFGFIFNKSDPAVHKVDLETM 480 
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10 



35 



40 



50 



55 



60 



QUERY : 


603 


SBJCT : 


481 


QUERY : 


663 


SBJCT : 


541 


QUERY : 


723 


SBJCT ; 


601 


QUERY : 


783 


SBJCT: 


661 



IIIIIIMIMMMMIMIMIMIIIIIIMIMMMIIMIMIMIIIIIII 

LKT IGLHHHGCVPQAMAHTHLGGYFF IQCRQDS PASAARQLLVDSVTDS VLGPNGDVT 540 
PHTSPDGRF IVSAAADS PWLHVQE I TVRGE I QTLYDLQ INSGI SDLAFQRS FTESNQY 722 

IIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIMIMIIIMMMIMMIMI! 

PHTS PDGRF IVSAAADS PWLHVQE ITVRGE IQTLYDLQINSG I SDLAFQRS FTESNQY 600 
SfAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPA 782 

MMIMIIIIMIIIMMIMIIIIIIIIIMIMIIIMIIIIMMMMMM 

^AALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWGGTHRIMRDSGLFGQYLLTPA 660 
SLFL INGRQNTLRCE VSG I KGGTT WWVGE V 815 

15 I 111 MM Ml I III 111 I 111 MM I Ml III 

661 RESLFLINGRQNTLRCEVSGIKGGTTWWVGEV 693 

The amino acid sequence of the FCTR2 protein has 451 of 772 amino acid residues 
20 (58%) identical to, and 586 of 772 residues (75%) positive with, the 773 amino acid residue 
proteins hypothetical protein DKFZp566D234.1 from Homo sapiens (fragments) (ptnr: 
SPTREMBL- ACC : CAB70877.1) (SEQ ID NO:49) (Table 2F). 

Table 2F. BLASTP of FCTR2 against hypothetical protein DKFZp566D234.1 (SEQ ID 

NO:49) 

25 > GI | 11360192 [ PIR[ [T46283 HYPOTHETICAL PROTEIN DKFZP566D234 . 1 - HUMAN (FRAGMENTS) 
Gl[ 6808053 |EMB1CAB70877.1 1 (AL13 7695) HYPOTHETICAL PROTEIN [HOMO SAPIENS] 
LENGTH =773 

SCORE = 911 BITS (2354), EXPECT = 0.0 
30 IDENTITIES = 451/772 (58%), POSITIVES = 586/772 (75%), GAPS = 7/772 (0%) 



QUERY: 


49 


SBJCT : 


2 


QUERY : 


109 


SBJCT : 


62 


QUERY : 


168 


SBJCT : 


122 


QUERY: 


228 


SBJCT : 


181 


QUERY: 


288 


SBJCT : 


241 


QUERY : 


348 


SBJCT : 


301 


QUERY : 


408 


SBJCT : 


361 


QUERY: 


468 


SBJCT : 


421 



ii II+M+ m i++ i + i iMiiii imii++iiiiii ++II++I++IM 

CVTS RE TGQAE CACMDL C KRH YKP VCGS DGE F YENHC E VHRAACL KKQ K I T I VHNE DC F F 6 1 



+++ || + || || + + || ++ | | + | + ||| + + | + | 
CCSKMKNMLLDLQNQRYIMQENENPNGDDISRKKLLVDQMFKYFDi 

/LKKQDLDEDLLGCSPGDLLRFDDYNSDSSLTLREFYMAFQWQL' 

I+I+++1 +11 1+ M++II + I + I I I III 1111 + 11 



45 1 1+ I MINIMI! + II I IMIMM + I llllllll 

SITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLGLEDINDFGDDGSLYITKVT r 
MGNYTCHASGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRI r 

+ MIII-H klh+llh MMMMMMMM + MM MMIMIMI I++ 

VGNYTCYADGYEQVYQTHI FQVNVPPVIRVYPESQAREPGVTASLRCHAEGI PKPQLC 
KNGVDVSTQMSKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDi 

III + I++ ++MII + I IMIMII + IMMMMMIMM M II II 1 1 1 l + M 



MIIIIMI +1 II II M +111 II I++II IM + M + M+ +1 + 11 

kNILWREEGLGIGNMFYVFYEDGIKVIQPIECEFQRHIKPSEKLLGFQDEVCPIi 
) PCQWVSAVNVRNR Y I YVAQ PALS RVLWD I QAQKVLQS I GVDPL PAKLS YDKSI 

I I IMM++++MMM I IIMI + I + IIM++ ll + l II MM 
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QUERY: 528 QWVLSWGDVHKSRPSLQVITEASTGQSQHLIRT PFAGVDDFFIPPTNLI INHIR 582 

lillltll + 1+ l + IIMI II Ml I MMIM I III l + l 

SBJCT: 481 QVWVLSWGTLEKTSPTLQVITIJ^GNVPHHTIHTQPVGKQFDRVDDFFIPTTTLIITHMR 540 
5 QUERY: 583 FGFIFNKSDPAVHKVDLETMMPLKTIGLHHHGCVPQAMAHTHLGGYFFIQCRQDSPASAA 642 

1 1 1 1 +1 + 1+ l + l 1 1 1 1 +111 I + lill++l + IMIII + M 1+ II + + 

SBJCT: 541 FGFILHKDEAALQKIDLETMSYIKTINLKDYKCVPQSLAYTHLGGYYFIGCKPDSTGAVS 600 

QUERY: 643 RQLLVDSVTDSVLGPNGDVTGTPHTSPDGRFIVSAAADSPWLHVQEITVRGEIQTLYDLQ 702 

10 l-HI M I I l + l I I I I I I 1+ I I I I ++M +1111+11111 +1 + 

SBJCT: 601 PQVMVDGVTDSVIGFNSDVTGTPYVS PDGHYLVS INDVKGLVRVQYIT IRGE IQEAFD I Y 660 

QUERY: 703 INSGISDLAFQRSFTESNQYNIYAALHTEPDLLFLELSTGKVGMLKNLKEPPAGPAQPWG 762 

I MIM NII++IIMI + 1+ l+ll+lll+lll l+l+llll II 

15 SBJCT: 661 TNLHISDLAFQPSFTEAHQYNIYGSSSTQTDVLFVELSSGKVKMIKSLKEPLKAEEWPWN 720 

QUERY: 763 GTHRIMRDSGLFGQYLLTPARESLFLINGRQNTLRCEVSGIKGGTTWWVGE 814 

+ 1 ++IMIIIIII + ll+++IM+++il I I ++ I 11+111+ 

SBJCT: 721 RKNRQIQDSGLFGQYLMTPSKDSLFILDGRLNKLNCEITEVEKGNTVIWVGD 772 

20 

The amino acid sequence of the FCTR2 protein has 61 of 194 amino acid residues (31%) 
identical to, and 90 of 194 residues (45%) positive with, the 306 amino acid residue protein 
Follastin-Related Protein 1 Precursor from Rattus Norvegicus (ptnr: GenBank Acc:Q62632) 
(SEQ ID NO:50) (Table 2G). 

25 Table 2G. BLASTP of FCTR2 against Follastatin-Related Protein 1 Precursor from Rattus 

Norvegicus (SEQ ID NO:50) 

> Gl| 2498392 [SP]Q62632|FRP RAT FOLLI STAT IN -RELATED PROTEIN 1 PRECURSOR 
GI 1 10836691 PIRl IS51361 FOLLISTATIN-RELATED PROTEIN PRECURSOR - RAT 
GI) 536900|GB|AAA66063.1) (U06864) FOLLI STAT IN - RELATED PROTEIN PRECURSOR [RATTUS 
30 NORVEGICUS] 

LENGTH = 3 06 



35 



SCORE = 86.4 BITS {213} , EXPECT = 1E-15 

IDENTITIES = 61/194 (31%), POSITIVES = 90/194 (45%), GAPS = 26/194 (13%) 

QUERY: 38 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I II I I ++ I III M + l l + l 11111+1+ I lll+lll III I + 
SBJCT: 2 9 CANVFCGAGRECAVTEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSK 87 

40 QUERY: 98 ITVIHSKDCFLKGD TCTMAGYARLKNVLLA-LQTRLQPLQEGDSRQDPASQK 148 

11+ I I II 1+ ++ 1+ + I I I 

SBJCT: 88 IQVDYDGHCKEKKSVSPSASPWCYQANRDELRRRI IQWLEAE IIP DGWFSKGSNY 143 

QUERY: 149 RLLVESLFRDLDADGNGHLSSSELAQHVLK KQDLDEDLLGCS PGDLLRF 197 

45 +++ |+ I +1+ II I I I + I + 1+ ++ I I 1 + 

SBJCT: 144 SE ILDKYFKSFD-NGDSHLDSSEFLKFVEQNETAVNI TAYPNQENNKLLRGLCVDALIEL 2 02 

QUERY: 198 DDYNSDSSLTLREF 211 

I l+l 1+ +11 
50 SBJCT: 203 SDENADWKLS FQEF 216 

The amino acid sequence of the FCTR2 protein has 61 of 194 amino acid residues (31%) 
identical to, and 89 of 194 residues (45%) positive with, the 306 amino acid residue protein 
Follastin-Related Protein 1 Precursor from Mus musculus (GenBank Acc:Q62356) (SEQ ID 
55 NO:51) (Table 2H). 
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Table 2EL BLASTP of FCTR2 against Follastatin-Related Protein 1 Precursor from Mus 

musculus (SEQ ID NO:51) 

> GI 1 6679871 | REF I NP 032073. l| FOLLISTATIN-LIKE [MUS MUSCULUS] 

GI [ 2498391 | SP | QS2356 [ FRP MOUSE FOLL I STATIN -RELATED PROTEIN 1 PRECURSOR (TGF-BETA- 
INDUCIBLE PROTEIN 
TSC-3 6) 

GI|481186)PIR| |S38251 FOLLISTATIN-RELATED PROTEIN - MOUSE 

GI| 349006 [GB|AAC3 7633 . 1 [ (M91380) TGF- BETA- INDUCIBLE PROTEIN [MUS MUSCULUS] 
LENGTH - 306 

SCORE = 85.2 BITS (210), EXPECT = 3E-15 

IDENTITIES = 61/194 (31%), POSITIVES = 89/194 (45%), GAPS « 26/194 (13%) 



QUERY : 


38 


SBJCT : 


29 


QUERY: 


98 


SBJCT : 


88 


QUERY : 


149 


SBJCT : 


144 


QUERY : 


198 


SBJCT : 


203 



I II I I ++ I III I l + l l + l 11111+1+ I lll+lll Ml I 

CANVFCGAGRECAVTEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGi 

ITVIHSKDCFLKGDT CTMAGYARLKNVLLA- LQTRLQPLQEGDSRQDPASC 

I I + I I II 1+ 1+ 1+ + I I I 



+ 



+ ++ 1+ I +1+ II III + I + 1+ ++ I 1+ 

SBJCT: 144 SEILDKYFKSFD-NGDSHLDSSEFLKFVEQNETAINITTYADQENNKLLRSLCVDALIEL 202 



The amino acid sequence of the FCTR2 protein has 63 of 193 amino acid residues (32%) 
identical to, and 89 of 193 residues (45%) positive with, the 299 amino acid residue protein 
Follastatin-Related Protein from the African Clawed Frog (GenBank Acc:JG0187) (SEQ ID 
NO:52) (Table 21). 

Table 21. BLASTP of FCTR2 against Follastatin-Related Protein from the African Clawed 

Frog (SEQ ID NO:52) 

> Gl[7512162lPIR[ [JG0187 FOLLISTATIN-RELATED PROTEIN - AFRICAN CLAWED FROG 
LENGTH = 299 

SCORE =81.8 BITS (201), EXPECT = 3E-14 

IDENTITIES = 63/193 (32%), POSITIVES = 89/193 (45%) , GAPS = 25/193 (12%) 
KKFCSRGSRCVLSRKTGEPECQCLEACRPS YVPVCGSDGRFYENHCKLHRAACLLGKR 9 7 

II I I ++ I l + l I M 1+ IMIM+ i IIMM III I + 

STVFCGAGRECAVTEK-GDPTCDCIEKCKSHKRPVCGSNGKTYLNHCELHRDACLTGSK 8 6 

i/IHSKDCFLK-GDT CTMAGYARL- KNVLLALQTRLQPLQEGDSRQDPASQK 148 

h I I II I + + + 1+ III + % l I I 

tfDYDGHCKEKTSDTPAAVPVACYQSDRDEMRRRVIHWLQTEITP DGWFSKGSDY 142 

LVES LFRDLDABGNGHLS SS E LAQHVLKKQDL DED LLGCS PGDLLRFD 198 

+++ 1+ I 1 1+ 1 1 l + l I + + I 11+ I 1 + 



l + l 



QUERY : 


38 


SBJCT : 


28 


QUERY: 


98 


SBJCT : 


87 


QUERY: 


149 


SBJCT : 


143 


QUERY: 


199 


SBJCT: 


202 
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15 



20 



25 



30 



The amino acid sequence of the FCTR2 protein has 59 of 194 amino acid residues (30%) 
identical to, and 90 of 194 residues (45%) positive with, the 308 amino acid residue protein 
Follistatin-Related Protein 1 Precursor from Homo sapiens (GenBank Acc:Q12841) (SEQ ID 
NO:53) (Table 2J). 

Table 2 J. BLAST? of FCTR2 against Follistatin-Related Protein 1 Precursor from Homo 

sapiens (SEQ ID NO:53) 

> GI[ 5901956 [RBFlNP 009016. li FOLLISTATIN-LIKE 1 [HOMO SAPIENS] 
Gl|2498390lSP[Q1284l|FRP HUMAN FOLLISTATIN-RELATED PROTEIN 1 PRECURSOR 
GlllQ82372|PIRl 1S51362 FOLLISTATIN-RELATED PROTEIN - HUMAN 

GI[ 536898 jGBjAAA66062.il (U06863) FOLLISTATIN-RELATED PROTEIN PRECURSOR [HOMO 
SAPIENS] 

GI [ 3184393 | DB J [ BAA28707 . 1 1 (D89937) FOLLISTATIN-RELATED PROTEIN {FRP) [HOMO SAPIENS} 
GI [ 12652619 ] GB | AAH00055 . 1 | AAH00055 (BC0 00055) FOLL I S TAT IN - L I KE 1 [HOMO SAPIENS] 
LENGTH - 308 

SCORE = 82.9 BITS (204), EXPECT = 1E-14 

IDENTITIES = 59/194 (30%) , POSITIVES = 90/194 (45%) , GAPS - 26/194 (13%) 
QUERY: 3 8 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I II I I ++ I 1 1 1 I 1 + M + l IMII + I + I iii+ill III I + 

SBJCT: 31 CANVFCGAGRECAVTEK-GEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSK 89 

QUERY: 98 ITVIHSKDCFLKGD TCTMAGYARLKNVLLA- LQTRLQPLQEGDSRQDPASQK 148 

11+ II I + |+ + + 1 + + I I I 

SBJCT: 90 IQVDYDGHCKEKKS VS PSAS PWCYQSNRDELRRRI I QWLEAE IIP DGWFSKGSNY 145 

QUERY: 149 RL L VE S L FRDLDADGNGHL S S S E LAQH VL KK QDLDEDLLGCS PGDLLRF 197 

+++ |++ | +|+ | | [ | + | + t+ ++ I I 1+ 

SBJCT: 146 SE I LDKYFKNFD - NGDSRLDSS EFLKFVEQNETAIN ITTYPDQENNKLLRGLCVDAL I EL 204 

QUERY: 198 DDYNSDSSLTLREF 211 

I l+l 1+ +11 

SBJCT: 205 SDENADWKLSFQEF 218 



35 The amino acid sequence of the FCTR2 protein has 35 of 69 amino acid residues (50%) 

identical to, and 45 of 69 residues (64%) positive with, the 315 amino acid residue Flik protein 
[Gallus gallus] (EMBL Acc:CAB42968,l) (SEQ ID NO:54) (Table 2K). 

Table 2K. BLASTP of FCTR2 against Flik protein [Gallus gallus] (SEQ ID NO:54) 

> GI [4837645 [ SMB 1 CAB42958 . 1 j (AJ238977) FLIK PROTEIN [GALLUS GALLUS] 
40 LENGTH = 315 

SCORE = 79.8 BITS (196), EXPECT = 1E-13 

IDENTITIES = 35/69 (50%), POSITIVES = 45/69 (64%), GAPS = 1/69 (1%) 

45 QUERY: 38 CGKKFCSRGSRCVLSRKTGEPECQCLEACRPSYVPVCGSDGRFYENHCKLHRAACLLGKR 97 

I II 11+ I ++ I III I l+l l+! MIN + I+ I lll + lll ill I + 
SBJCT: 31 CANVFCGRGAECAVTEK-GEPTCLCIEQCKPHGRPVCGSNGKTYLNHCELHRDACLTGSK 89 

QUERY: 98 ITVIHSKDC 106 

50 | | + | 

SBJCT: 90 IQVDYDGHC 98 

The amino acid sequence of the FCTR2 protein has 49 of 152 amino acid residues (32%) 
identical to, and 65 of 152 residues (42%) positive with a 272-420 amino acid fragment and, 31 



25 



15966-697 



of 83 residues (37%) identical to and 44 of 83 residues (52%) positive with a 248-329 amino 
acid fragment, both of the 1375 amino acid residue Frazzled gene protein [Drosophila 
melanogaster] (GenBankAcc:T13822) (SEQ ID NO:55) (Table 2L). 

Table 2L. BLASTP of FCTR2 against Frazzled gene protein [Drosophila melanogaster] 
5 (SEQ ID NO:55) 

> Gl] 7511861 [FIR I 1T13822 FRAZZLED GENE PROTEIN - FRUIT FLY (DROSOPHILA MELANOGASTER) 
GI [ 1621115 | GB 1 AAC47314 . 1 [ (U71001) FRAZZLED [DROSOPHILA MELANOGASTER] 
LENGTH = 13 75 

10 SCORE = 69.4 BITS (169), EXPECT = 2E-10 

IDENTITIES - 49/152 (32%), POSITIVES = 65/152 (42%), GAPS = 4/152 (2%) 

QUERY: 243 CAVHGDLRPPI IWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGH-EQ 301 

I +| +111 111+ l+l 11+ II 1+ I IM I M + 

15 SBJCT: 272 CVANGVPKPQIKWLRNGMDLDFNDLDSRFSIVGTGSLQISSAEDIDSGNYQCRASNTVDS 331 

QUERY: 3 02 LFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQ 361 

I +11 II 1+ l+l I I I I Mill ++ I 

SBJCT: 332 LDAQATVQVQEPPKFIKAPKDTTAHEKDEPELKCDIWGKPKPVIRWLKNGDLITPNDYMQ 391 



20 



QUERY: 362 LSLLANGSELHI SSVRYEDTGAYTCIAKNEVG 393 

I +1 I I + I I + 1+ I I 
SBJCT: 392 LVDGHNLKILGLLNSDAGMFQCVGTNAAG 420 



25 SCORE = 52.9 BITS (126), EXPECT = 1E-05 

IDENTITIES = 31/83 (37%), POSITIVES = 44/83 (52%), GAPS = 2/83 (2%) 

QUERY: 311 NVP PVIRVYPE S QAQE PGVAASLRCHAEG I PMPR ITWLKNGVDVS - TQMS KQLSLLANGS 369 

+ 11 11+ I +1 I I l + l l + l M + II + I+ + + I++ M 

30 SBJCT: 24 8 SVAPSFLVGPSPKTVREGDTVTLDCVANGVPKPQ IKWLRNGMDLDFNDLDSRFS IVGTGS 3 07 

QUERY: 370 ELHI S S VRYEDTGAYTC I AKNE V 392 

I III Mill 

SBJCT: 308 -LQISSAEDIDSGNYQCRASNTV 329 

35 

The amino acid sequence of the FCTR2 protein has 53 of 177 amino acid residues (29%) 
identical to, and 78 of 177 residues (43%) positive with a 366-539 amino acid fragment, 51 of 
170 residues (30%) identical to and 74 of 170 residues (43%) positive with a 276-438 amino 
acid fragment, 46 of 165 amino acid residues (27%) identical to, and 74 of 165 amino acid 
40 residues positive with a 185-341 amino acid fragment, 48 of 167 amino acid residues (28%) 
identical to and 70 of 167 amino acid residues (41%) positive with a 77-243 amino acid 
fragment, and 28 of 84 amino acid residues (33%) and 37 of 84 amino acid residues positive with 
a 56-139 amino acid fragment all of the protein 1395 residue Roundabout 1 protein [Drosophila 
melanogaster] (GenBankAcc:AAC38849.1) (SEQ ID NO:56) (Table 2M). 

45 Table 2M. BLASTP of FCTR2 against Roundabout 1 protein [Drosophila melanogaster] 

(SEQ ID NO:56) 

> GI | 28 04 782 | GB i AAC3 884 9 . 1 1 (AF040989) ROUNDABOUT 1 [DROSOPHILA MELANOGASTER] 
LENGTH = 1395 

50 SCORE = 69.8 BITS (170), EXPECT - 1E-10 

IDENTITIES = 53/177 (29%) , POSITIVES = 78/177 (43%) , GAPS = 11/177 (6%) 
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QUERY: 243 CAVHGDLRPPIIWKRNGL-TLNFLDLEDINDF-GEDDSLYITKVTTIHMGNYTCHA 296 

| |+ I + I + 1+ I I I + + I +1 Ml MM 

SBJCT: 366 CMASGNPPPSVFWTKEGVSTLMFPNSSHGRQYVAADGTLQITDVRQEDEGYYVCSAFSW 425 

5 

QUERY ■ 297 - - SGHEQLFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGI PMPRITWLKNGVDV 354 

I I + M + I++ Ml I l + M II I I II I + 1 I 

SBJCT: 426 DSSTVRVFLQVSSVDERPPPIIQIGPANQTLPKGSVATLPCRATGNPSPRIKWFHDGHAV 485 
10 QUERY: 3 55 STQMSKQLSLLANGSELH I SS VRYEDTGAYTC I AKNEVGVDED I SSLF I EDSARKTL 411 

I + I++ MM ++ l + M I M M ++I +1 +1 

SBJCT: 486 - -QAGNRYSI I-QGSSLRVDDLQLSDSGTYTCTASGERGETSWAATLTVEKPGSTSL 539 

SCORE = 56.3 BITS (135), EXPECT = 1E-06 
15 IDENTITIES = 51/170 (30%), POSITIVES = 74/170 (43%), GAPS - 12/170 (7%) 

QUERY * 243 CAVHGDLRPPIIWKRNGLTLNFLDL^ 301 

Ml II I ++M + + ++I + M l + + l M I I + I 

SBJCT: 276 CSVGGDPPPKVLWKKEEGNIPVSRARILHD EKSLEISNITPTDEGTYVCEAHNNVGQ 332 



20 



50 



55 



65 



QUERY: 302 LFQTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQM 358 

I |+ M I ++ - I II II M II I Ml I 

SBJCT: 333 I SARAS L I VHAP PNFT KR P SNKKVG LNGWQ L P CMAS GNP P P S VF WTKEG - - V S TLMF PN 390 



25 QUERY: 359 -SKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFIEDSA 407 

I +1 I |+ II II I I I M II + I+ + 1+ 

SBJCT: 391 SSHGRQYVAADGTLQITDVRQEDEGYYVCSAFSV- -VDSSTVRVFLQVSS 438 

SCORE = 51.7 BITS (123), EXPECT = 3E-05 
30 IDENTITIES = 46/165 (27%), POSITIVES = 74/165 (43%), GAPS - 20/165 (12%) 



QUERY : 


251 


SBJCT : 


185 


QUERY : 


299 


SBJCT : 


242 


QUERY: 


359 


SBJCT : 


300 


SCORE 


= 44 



++ l+ 1+ 11+ ++ II + +1 1+ I I III I 

tKDGVPLD--DLKAMS-FGASSRVRIVDGGNLLISNVEPIDEGNYKC: 

JTHVLQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNf 
I 1+ I I 1+ I I I I++ I I 
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40 



45 IDENTITIES = 48/167 (28%), POSITIVES = 70/167 (41%), GAPS = 13/167 (7%) 

i/HGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHM GNYTCHASG 298 



QUERY: 


243 


SBJCT : 


77 


QUERY : 


299 


SBJCT : 


137 


QUERY : 


355 


SBJCT : 


197 


SCORE 


= 42 



+ + + + + + 



Mh I I I I +111 I + MM1 1 + 



60 IDENTITIES = 28/84 (33%), POSITIVES = 37/84 (43%), GAPS - 4/84 (4%) 



QUERY: 314 PVIRVYPESQAQEPGVAASLRCHAEGIPMPRITWLKNGVDVSTQMSKQLSLIANGSELH- 372 

II +1 + l + M M I M M + l III I + I 

SBJCT: 56 PRIIEHPTDLWKKNEPATLNCKVEGKPEPTIEWFKDGEPVSTNEKKSHRVQFKDGALFF 115 

QUERY: 3 73 1 SS VRYEDTGAYTC IAKNEVG 3 93 

+ + +1 II l+IIMI 
SBJCT: 116 YRTMQGKKEQDGGEYWCVAKNRVG 139 
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The amino acid sequence of the FCTR2 protein has 55 of 157 amino acid residues 
(35%) identical to, and 75 of 1 57 residues (47%) positive with a 620-775 amino acid fragment, 
49 of 163 residues (30%) identical to and 71 of 163 residues (43%) positive with a 335-492 
amino acid fragment, 32 of 85 amino acid residues (37%) identical to, and 48 of 85 amino acid 

5 residues (55%) positive with a 1305-1388 amino acid fragment, 37 of 143 amino acid residues 
(25%) identical to and 60 of 143 amino acid residues (41%) positive with a 183-319 amino acid 
fragment, 43 of 174 amino acid residues (24%) and 70 of 174 amino acid residues (39%) 
positive with a 71 1-884 amino acid fragment, and 46 of 165 residues (27%) identical to and 69 
of 165 residues positive with a 831-884 amino acid fragment all of the protein 1395 residue 

10 Down Syndrome Cell Adhesion Molecule Precursor (CHD2) from Homo Sapiens 
(GenBankAcc:O60469) (SEQ ID NO:57) (Table 2N). 

Table 2N. BLASTP of FCTR2 against Down Syndrome Cell Adhesion Molecule Precursor 

(SEQ ID NO:57) 

15 > cri|l2S43619[sp|O60463|DSCA HUMAN DOWN SYNDROMS CELL ADHESION MOLECULE PRECURSOR 
(CHD2) 

Gl[ 6740013 [GB|AAF2752 5.1 [AF2 17525 1 (AF217525) DOWN SYNDROME CELL ADHESION MOLECULE 
[HOMO SAPIENS] 

LENGTH = 2012 



20 



35 



50 



SCORE =70.6 BITS (172), EXPECT = 6E-11 

IDENTITIES = 55/157 (35%), POSITIVES = 75/157 (47%), GAPS = 7/157 (4%) 



QUERY: 245 VHGDLRP P 1 1 W KRNG LT LNFLDLE D I ND FGE DD S L Y I T KVTT I HMGNYTCHASGHEQL FQ 304 

25 I 111 I l+H + I++ II I+++ +1 Mill I 

SBJCT: 620 VSGDLP I TI TWQKDGRP I PGSLGVT I DNIDFTS SLR I SNLSLMHNGNYTC I ARNEAAAVE 679 

QUERY: 305 THV-LQVNVPPVIRVYPESQAQEPGVAASLRCHAEGIPMPRITW-LKNGVDVST QM 358 

I I IN I ! I I i I I I I i 1 + 1 I I II 

30 SBJCT: 680 HQSQLIVRVPPKFWQPRDQDGIYGKAVILNCSAEGYPVPTIVWKFSKGAGVPQFQPIAL 739 

QUERY: 359 SKQLSLLANGSELH IS S VRYEDTGAYTC I AKNE VGVD 395 

+ ++ +I+III I I I IN I I Ml I 

SBJCT: 740 NGRIQVLSNGS - LLI KHWEEDSGYYLCKVSNDVGAD 775 



SCORE = 50.6 BITS (120), EXPECT = 7E-05 
IDENTITIES = 49/163 (30%), POSITIVES = 71/163 (43%), GAPS = 16/163 (9%) 



QUERY: 243 CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTTIHMGNYTCHASGHEQL 302 
40 M| + | M| || | ++ I + + Ml 

SBJCT: 335 CSVTGTEDQELSWYRNGEILNPGKNVRITGINHEN-LIMDHMVKSDGGAYQCFVRKDKLS 393 

QUERY: 303 FQTH VLQVNVPP VI RVYPESQAQEPGVAASLRCHAEG I PMPRI TW LKNGV 352 

1+ 11+ I +1 + I + I II 1+ +1 M III Ml 

45 SBJCT: 394 AQDYVQWLEDGTPKIISAFSE-KWSPAEPVSLMCNVKGTPLPTITWTLDDDPILKGG- 451 

QUERY: 353 DVSTQMSKQLSLLAN - GSELHI SS VRYEDTGAYTC I AKNE VGV 394 

I ++I + ++ I 11+111+ MINI II 

SBJCT: 452 - -SHRISQMITSEGNWSYLNISSSQVRDGGVYRCTANNSAGV 492 



SCORE = 47.9 BITS (113), EXPECT = 5E-04 
IDENTITIES = 32/85 (37%), POSITIVES = 48/85 (55%), GAPS = 6/85 (7%) 



QUERY: 333 LRCHAEGIPMPRITWLK- -NGVDVSTQMSKQLSLLANGSELHISSVRYEDTGAYTCIAKN 390 

55 II I I I I + l + l II + + 1+ +111 + I +1+ M + l l + lll I 

SBJCT: 1305 LPCKAVGDPSPAVKWMKDSNGTPSLVTIDGRRSIFSNGSFI- IRTVKAEDSGYYSCIANN 1363 
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QUERY: 3 91 EVGVDEDISSLFIE DSARKTLA 412 

I M I +1 I I |h 

SBJCT: 1364 NWGSDEI ILNLQVQVPPDQPRLTVS 1388 

SCORE =42.9 BITS (100), EXPECT - 0.015 
IDENTITIES = 37/143 (25%), POSITIVES = 60/143 (41%), GAPS = 6/143 (4%) 

QUERY: 270 INDFGEDDSLYITKVTTIHMGNYTCHASGHEQLPQTHVLQVNVPPVIRVYPESQAQEPGV 329 

I I +1 II + I I I +11 + III + + I 

SBJCT: 183 IKDVQNEDGLYNYRCITRHRYTGETRQSNSARLFVSD- -PANSAPSILDGFDHRKAMAGQ 240 

QUERY: 330 AASLRCHAEGIPMPRITWLKNGVDVSTQMSKQLSLIjANGSELHISSVRYEDTGAYTCIAK 389 

I I I I I I 111+ ++ ++I + III ++I l+l+l I 

SBJCT: 241 RVELPCKALGHPEPDYRWLKD- -NMPLELSGRFQKTVTG- -LLIENIRPSDSGSYVCEVS 296 

QUERY: 390 NEVGVDEDISSLFIEDSARKTLA 412 

I I + I I+++ + I++ 
SBJCT: 297 NRYGTAKVTGRLYVKQPLKATIS 319 

SCORE =41.3 BITS (96), EXPECT = 0.047 

IDENTITIES = 43/174 (24%), POSITIVES = 70/174 (39%), GAPS = 11/174 (6%) 



QUERY : 


243 


SBJCT : 


711 


QUERY : 


299 


SBJCT : 


771 


QUERY : 


357 


SBJCT : 


831 


SCORE 


= 40 



+1 +1 II + 



JELHISSVRY EDTGAYTCI AKNEVGVDED I S SLFI ED 405 

I II+++ ll+l ++I I I II I I +++ 



IDENTITIES = 46/165 (27%), POSITIVES = 69/165 (40%), GAPS = 7/165 (4%) 

QUERY: 243 CAVHGDLRPPIIWKRNGLTLNFLDLEDINDFGEDDSLYITKVTT- IHMGNYTCHASGHEQ 301 

Ml ll+l II + I + +1 ++ I + I 111+ I 

SBJCT: 525 CRVIGYPYYSIKWYKNSNLLPFNHRQVA- - FENNGTLKLSDVQKEVDEGEYTCNVLVQPQ 582 

QUERY: 3 02 LFQTHVLQVN- -VPPVIRVYPESQAQEPGVAASLRCHAEGIPMP-RITWLKNGVDVSTQM 358 

I + + I III 1+ + I I +1 +1 III l+l + + 

SBJCT: 583 LSTSQSVHVTVKVPPFIQPF-EFPRFSIGQRVFIPCWVSGDLPITITWQKDGRPIPGSL 641 

QUERY: 3 59 SKQLSLLANGSELHISSVRYEDTGAYTCIAKNEVGVDEDISSLFI 4 03 

+ + II II++ I lllll + M 111 + 

SBJCT: 642 GVTIDNIDFTSSLRISNLSLMHNGNYTCIARNEAAAVEHQSQLIV 6 86 

The amino acid sequence of the FCTR2 protein has 55 of 194 amino acid residues (28%) 
identical to, and 86 of 194 residues (44%) positive with Limbic System- Associated Membrane 
Protein Precursor (LSAMP) from Homo sapiens (SWISSPROT Acc:Q13449) (SEQ ID NO:58) 
(Table 20). 

Table 20. BLASTP of FCTR2 against Limbic System- Associated Membrane Protein 

Precursor (SEQ ID NO:58) 

PTNR: SWISSPROT ~ACC:Q1344 9 LIMBIC SYSTEM- ASSOCIATED MEMBRANE PROTEIN PRECURSOR 
(LSAMP) - HOMO SAPIENS (HUMAN) , 338 AA. 

LENGTH =338 

SCORE = 191 (67.2 BITS), EXPECT = 6.7E-12, P = 6.7E-12 
IDENTITIES - 55/194 (28%), POSITIVES = 86/194 (44%) 
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The amino acid sequence of the FCTR2 protein has 68 of 190 amino acid residues (35%) 
identical to ? and 92 of 190 residues (48%) positive with Putative Neuronal Cell Adhesion 
Molecule, Short Form from Mus musculus (SPTREMBL Acc:O70246) (SEQ ID NO:59) (Table 
5 2P). 

Table 2P. BLASTP of FCTR2 against Putative Neuronal Cell Adhesion Molecule, Short 

Form from Mus musculus (SEQ ID NO:59) 

PTNR:SPTREMBL-ACC:O70246 PUTATIVE NEURONAL CELL ADHESION MOLECULE (PUNC) 
(PUTATIVE NEURONAL CELL ADHESION MOLECULE, SHORT FORM) - MUS MUSCULUS 
10 (MOUSE) , 793 AA 

LENGTH = 793 

SCORE = 203 (71.5 BITS), EXPECT = 7.0E-12, SUM P{2} = 7.0E-12 
IDENTITIES = 68/190 (35%) , POSITIVES = 92/190 (48%) 

15 

The amino acid sequence of the FCTR2 protein has 58 of 199 amino acid residues (29%) 
identical to, and 91 of 199 residues (45%) positive with CHLAMP, Gl 1-Isoform Precursor from 
Gallus gallus (SPTREMBL Acc: 002869) (SEQ ID NO:60) (Table 2Q). 

Table 2Q. BLASTP of FCTR2 against CHLAMP, Gll-Isoform Precursor from Gallus 
20 gallus (SEQIDNO:60) 

PTNR:SPTREMBL-ACC:O02869 CHLAMP, Gll-ISOFORM PRECURSOR - GALLUS GALLUS 
(CHICKEN) , 350 AA. 

LENGTH = 350 

25 SCORE = 191 (67.2 BITS), EXPECT = 7.7E-12, P = 7.7E-12 

IDENTITIES = 58/199 (29%) , POSITIVES = 91/199 (45%) 

The amino acid sequence of the FCTR2 protein has 55 of 194 amino acid residues (28%) 
identical to, and 86 of 194 residues (44%) positive with Limbic System-Associated Membrane 
30 Protein Precursor (LSAMP) from Rattus norvegicus (SWISSPROT Acc:Q62813) (SEQ ID 
NO:61) (Table 2R). 

Table 2R. BLASTP of FCTR2 against Limbic System- Associated Membrane Protein 
Precursor (LSAMP) from Rattus norvegicus (SEQ ID NO:61) 

PTNR:SWISSPROT-ACC:Q62813 LIMBIC SYSTEM- ASSOCIATED MEMBRANE PROTEIN PRECURSOR 
35 (LSAMP) - RATTUS NORVEGICUS (RAT) , 338 AA. 
LENGTH = 338 

SCORE = 188 (66.2 BITS), EXPECT = 1.5E-11, P = 1.5E-11 
IDENTITIES = 55/194 (28%) , POSITIVES = 86/194 (44%) 

40 

FCTR2 protein has similarity to cell adhesion molecules, follistatin, roundabout and 
frazzled (see BlastP results). These genes are involved in neuronal development and 
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reproductive physiology. Frazzled encodes a Drosophila member of the DCC immunoglobulin 
subfamily and is required for CNS and motor axon guidance (Cell 87:197-204(1996)). 
Characterization of a rat C6 glioma-secreted follistatin-related protein (FRP) and cloning and 
sequence of the human homologue is described in Eur. J. Biochem. 225:937-946(1994). This 
protein may modulate the action of some growth factors on cell proliferation and differentiation. 
FRP binds heparin. The follistatin-related protein is a secreted protein and has one follistatin-like 
domain. The cloning and early dorsal axial expression of Flik, a chick follistatin-related gene and 
evidence for involvement in dorsalization/neural induction is presented in Dev. Biol. 178:327- 
342(1996). Roundabout controls axon crossing of the CNS midline and defines a novel 
subfamily of evolutionarily conserved guidance receptors, as shown in Cell 92:205-215(1998). 
cDNA cloning and structural analysis of the human limbic-system- associated membrane protein 
(LAMP) is described in Gene 170:189-195(1996). LAMP, a protein of the OBCAM family that 
contains three immunoglobulin-like C2-type domains, mediates selective neuronal growth and 
axon targeting. LAMP contributes to the guidance of developing axons and remodeling of 
mature circuits in the limbic system. This protein is essential for normal growth of the 
hippocampal mossy fiber projection. LAMP is attached to the membrane by a GPI- Anchor. It is 
expressed on limbic neurons and fiber tracts as well as in single layers of the superior colliculus, 
spinal chord and cerbellum. Characterization of the human full-length PTK7 cDNA encoding a 
receptor protein tyrosine kinase-like molecule closely related to chick KLG is disclosed in J. 
Biochem. 1 19:235-239(1996). Based upon homology, FCTR2 proteins and each homologous 
protein or peptide may share at least some activity. 

Functions and therapeutic uses: 
The OMIM gene map has identified this region which the invention maps to (5q21-5q31) 
as associated with susceptibility to the following diseases (OMIM Ids are underlined): 

• Allergy and asthma 

• Hemangioma, 

• capillary infantile Schistosoma mansoni infection, susceptibility/resistance to 
Spinocerebellar ataxia 

• Bronchial asthma 

• Plasmodium falciparum parasitemia, 

• intensity of Corneal dystrophy, Groenouw type I, 121900 ; Corneal dystrophy,lattice type 
I, 122200 ; 

• Reis-Bucklers corneal dystrophy;Corneal dystrophy, Avellino type Eosinophilia, familial 
Myelodysplastic syndrome; 
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• Myelogenous leukemia, Acute Cutis laxa, recessive, type I, Deafness, autosomal 
dominant nonsyndromic sensorineural, 1 Contractural arachnodactyly, Congenital 
Neonatal alloimmune thrombocytopenia; 

• Glycoprotein la deficiency Male infertility; 

5 • Charcot-Marie-Tooth neuropathy, Demyelinating Gardner syndrome ; 

• Adenomatous polyposis coli; 

• Colorectal cancer; 

• Desmoid disease, hereditary, 135290 ; 

• Turcot syndrome,276300; 

10 • Adenomatous polyposis coli, attenuated 

• Colorectal cancer 



Therefore the invention is implicated in at least all of the above mentioned diseases and 
may have therapeutic uses for these diseases. 
15 This sequence has similarity to cell adhesion molecules, follistatin, roundabout and frazzled 

(see BlastP results). These genes are involved in neuronal development and reproductive 
physiology. Therefore the invention is also implicated in disorders such as or therapeutic uses 
for: 

• Neurodegenerative disorders, nerve trauma, epilepsy, mental health conditions 
20 • Tissue regeneration in vivo and in vitro 

Female reproductive system disorders and pregnancy 

FCTR3 

FCTR3, is an amino acid type II membrane, neurestin-like protein. The FCTR3a nucleic 
25 acid of 1430 nucleotides (also designated 10129612.0.1 18) is shown in Table 3A. An ORF was 
identified beginning with an ATG initiation codon at nucleotides 69-71 and ending with a TAG 
codon at nucleotides 1212-1214. A putative untranslated region upstream from the initiation 
codon and downstream from the termination codon is underlined in Table 3 A, and the start and 
stop codons are in bold letters. 

30 Table 3A. FCTR3a Nucleotide Sequence (SEQ ID NO:5) 

AAAAAAGGCGGGGGGTGGACTTAGCAGTGTAATTTGAGACCGGTGGTAAGGATTGGAGCGAGCTAGAG ATGCTGCACGCTGCTAACA 
AGGGAAGGAAGCCTTCAGCTGAGGCAGGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCT 
CCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTC 
AATTCTCCCCCAATTCATACCTGCTCAGAGCATGCTC^ 
35 CGACTCTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTO 

TGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATC 

GCTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCT 
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CTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTT 
TCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAA 

TGGCGTATTTCATAGTGCCCTGGTCGTTGAAAAAC^GCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCGGGTAACACAAGAAG 

TCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTC^ 
5 TCTTTGGTGTTTAC^TAAGAAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGA 
GTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCC 
TGTGGCATCTGGCCTTCTACAATGATGGAZVAAGACAAAGAGATGGTT^ 
AGAAAAACAflGCTCIAGGGCGCCCaCTGATTTGACATTATGATTCAG 

GTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGACCTTCCTGCCTTGACTTTCTGCAAGACAATC 
10 ATTAATAAAGCTGCTCTGTAAATACTAAAAAAAAAACA 



The FCTR3 polypeptide (SEQ ID NO:5) encoded by SEQ ID NO:5 is 381 amino acid 
residues and is presented using the one-letter code in Table 3B. 

Table 3B. Encoded FCTR3a protein sequence (SEQ ID NO:6). 

15 MLHAANKGRKPSAEAGRPIPPTSSPSLLPSAQLPSSH^ 

PNHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPISro 
TPLFSSSSPGYPLTSGTWTPPPRLLPRNTFSRKAFKL^ 

RRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGWIRRGLPPSHAQYDFMSRLDGKEKWSWESPRERRSIQTLVQNEAVFV 
QYLDVGLWHLAFYNDGKDKEMVS FNTWLDGTI 

20 

In an alternative embodiment, the 5' end of the FCTR3a nucleic acid could be extended 
as it is in the 9826bp FCTR3b (also referred to herein as 10129612.0.405) shown in Table 3C. 
An ORF was identified beginning with an ATG initiation codon at nucleotides 280-282 and 
ending with a TAA codon at nucleotides 8479-8481. A putative untranslated region upstream 
25 from the initiation codon and downstream from the termination codon is underlined in Table 3C, 
and the start and stop codons are in bold letters. Italicized bases 1-201 refer to a variable 5 ? 
region that will be further discussed below. 

Table 3C FCTR3b Nucleotide Sequence (SEQ ID NO:7) 

TTTAAATCCTCATACCTTAAAGGAGATGTGTATATAAGGGAGTTGGAACCAGCATTAGATGAGTTGACAAAAATGCAGTT 
30 TCAGTTCTAGAGGTCTGGGAAGTCCAAGAACAAGGTGCTGGCAGATTGGATTCCCCGTGAGGGCTTTCTTCCTGGCTTGA 
AGTTGGCTGCTTTCCTGCTGAGXCTTCTCATGGCAGAGACTGAGGGTX&C^J&GTG 

CTTTTCTGAAAACATCAGCATTCTGCCATATCTGGAATAA TGGATGTAAAGGACCGGCGACACCGCTCTTTGACCAGAGG 
ACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGGCCACACAGAAATCCTACA 
GCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCTCATCCACCGG 

35 GAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCCCACACCGAAG 
CGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGG 
GAGGGATGTCTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTCGT 
GAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCATTCCACCTAC 
ATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGC 

40 TAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATC 
TGCTC^GGGCCCCkGCAAGCCTCC^GC^GTGGCCCTCCGAACC 
CCCTCACAACCACACGCTGTCCC^TCACCACTCGTCCGCC^ 
AGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAG 

AACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTC 
45 CCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGG 
CTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGCCATTGCCGCGGCCCTCCTCTTG 
GCTATTTTGCTGGCGTATTTCATAGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCG 
GCGGGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCA 
ACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCATGCCCAGTATGACTTC 
50 ATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCA 
GAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGA 
TGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCC 
GGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGG 
ACAATATTCTAAAGGGACGTGC CAGTGC TACAG CGG CTGGAAAGGTG CAGAGTG CGAC GTG C C C ATGAAT CAGTG CATCG 
55 ATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAG 
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GAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGG 

TGGTCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTACCTGCCTGACACGGGCC 

TCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTC 

TGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCAT 

TGAGCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGGTGGAATGGTGAACACTGCACCATTGGTAGGCAAA 

CGGCAGGCACCGAAACAGATGGC TGC CCTGACTTGTG CAACGGTAACGGGAGATG CACACTGGGT CAGAACAG CTGGCAG 

TGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCCTGTGCTGATAACAAGGATAATGA 

GGGAGATGGCCTGGTGGATTGTTTGGACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGT 

CCCGGGACCCACTGGACATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAGCTC 

TTGGCAGGCAAGGATAGCACCC^CATCATTCCTGGAGAGAACCCTTTCAACAGCAGCTTGGTTTCTCTCATCCGA 

AGTAGTAACTACAGATGGAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATCA 

CCCGCCAGGATGGCACGTTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTCATG 

AGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTGGTGATGAAGACCGAGGAGAACTC 

CATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTG 

CTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAA 

CTTCGCTATCTGAGCTCTAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAA 

CCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCT 

CCACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTT 

GAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAA 

CCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTCAATGTTAAAA 

AGTTCCTGACCCAGCAGCCTGCCATCATCACCAGC^TCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGC 
AACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGA 
CTTCAAT TACAT CCGACG CATCTTTC CCTCT CGAAATGTGAC CAGCAT CTTGGAGTTACGAAATAAAGAGTTTAAACATA 
GCAAC^CCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCTACGTGTCCGACACCAACAGCAGG 
AGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCA 
G TGTCTAC CCTTTGATGAAGC C CGCTG CGGGGATGGAGGGAAGG C CATAGATGCAACC CTGATGAGC C CGAGAGGTAT TG 
CAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCOVC^ 
CTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGA 
GTGG C CAACAGAC CTTGCTGT CAATC CCATGGATAACT C CTTGTATGTT CTAGAGAACAATGT CAT C CTTCGAAT CAC CG 
AGAACCACCAAGTCAGCATCATTGCGGGACGCCCCATGCACTG 

GCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGA 
GAAGAAGATTAACCGTCTACGCCAGGTAAGAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACT 
GCAAAAACGATGTCAATTGCAACTGGTATTCAGGAGATGATGGCTACGCGACTGATGCCATCTTGAATTCCCCATCATCC 
TTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAGACCT^ 

TGTTCTTAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCC 
ACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAATTG 
ATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGC^GTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCA 
GAT CAT CAC C C T CAC CG TGGG CAC C AATGGAGGCC TCAAAGT CGTG TC CACACAGAACC TGGAG CTTGGT CTCATGAC CT 
ATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGA 

CGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACAT 
TGAGAACTCCAACCGTGATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACAAG 
ATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGCTTC 
CACAGCGAGCCCC^TGTCCTAGCGGGCACCATCACCCC<^CC^^ 

CTTAAACTCCATTGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGGTCC 
ATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTCACC 
CTGAGGATC^TTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGC^GCGGGCTGGC^GCTGTCAACGTGTC^TA 
CTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACA^ 

TGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTTGAGAAGTCCATGGTCCTCCTGCTTCAGAGCCAA 
CGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCTTGCCGTCACCATGCCC^GCGTGGCCCGGCACAGCATGTC 
CACACACACCTCCATCGGCTACATCCGTAATATTTACAACCCGCCT 

ATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAGTTA 
T CAGAGATTGT C TACGACAGTAC CGC CGTCAC CT T CGGGTATGACGAGAC CACTGG TGTCT TGAAGATGGT CAAC C T CCA 
AAGTGGGGGCTTCTCCTGCACC^TCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGG 
AAGGCATGGTCAATGCGAGGTTTGACTACACCTATCATGAC^ 

GAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCAT 
CTATTATGACATCAACCAGATCATCACCACTGCCGTGA 

AGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCAAGAGG 

GAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGC 

CGTCAATGACCGCCCGACCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGC 

GCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGATGGC 

TATCTGTGCGAGAGAGGGTCTGACATCTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAA.CAAGGC 

GTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACT 

TCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAATGACTCCAACTCGGAGATTACCTGACT 

GACCTCC^GGGCCACCTCTTTGCC^TGGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCC 

TCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCA 

ACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCC^CTTCACTCAGCGT 

GATTATGATGTGCTGGC^GGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTT 

TAACC TGTATATGTTCAAGAG CAACAAT CC T C T CAGCAGTGAG CTAGAT TTGAAGAACTACG TGACAGATGTGAAAAG CT 

GGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCC 

TATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATT^ 
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GGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGCAGGTCACTGGTTTG 
CCACCACCACGCCCATCATTGGCAAAGGCATCATC 

GCCAGCGAAGATAGCCGCAAGGTGGCATCTGTGCTGAACAACGCCTACT^ 

C^GGACACCCACTACTTTGTGAAGATTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGG 
5 TGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAACATT 
GAGTTCCAGTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGT 
CCTGGACCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGA 
GCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGGGATATTACGTGCTT 
CCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGA 

10 ACAAAATAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCCTAAGGAGATGAAGAC 
CTAACAGGGGCACTGCGGCTGGGCTGCTTTAGGAGACCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCTACT 
GTCCAAGCGAGAAGTCCCTCATCCTGAAGTAGACTAAAGCCCGGGTGAAAATTCCGAGGAAAACAAAACAAACGAATC 
TGAACAGACAC^VCACAATGTTCCAAGTTCCCCTAAAATATGACCCACTTGTTCTGGGTCTACGCAGAAAAGAGACGCAAA 
GTGTCCAAAAGGAACAAAAGAACAAAAACGAATAAGCAAAGAAGAAAA 

15 CCGATAAACAAAGAAGCGAAGATAAGAAAGAAGGCCTCATATCCAATTAC 

ACATCCGCGAGGGCmGCGTCACCAGACCAGCTGCGGGACA?^ACCACTCAGACTGCTTGTAGGACAAATACTTCTGACAT 
TTTCGTTTAAGCAAATACAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGGGGATAA 
AAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGAAAAAAAAAC^ 

AAGTGAAATAATACCATCCAGCACTTAACTCTCAGGTCCCAACTAAGTCTGGCCTGAGCTAATTTATTTGAGCGCAGAGT 
20 GTAAAATTTAATTCAAAATGGTGGCTATAATCACTACAGATAAATTTCATACTCTTTTGTCTTTGGAGATTCCATTGTGG 
ACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGTATCAGTTTCGGTAGAGGTGCAGCATC 
GTGACACTTTTGCTAACAGGTAGCACTTCTGATCACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTTTCAGATT 
TAAAATTATTAGTGTGTTTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGAGAAAATTTAAAAA 
ATAAAAATAAAAACAAAAAAAATTTTAAAAATTAAAAAAACAAAAATAAAGT CTAATAAGAACTTTGGTACAGGAACT TT 
25 TTTGTAATATACATGTATGAATTGTTCATCGAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGG 
CAAAGATAATTTATGGCAAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 



The FCTR3b polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 is 2733 amino acid 
residues and is presented using the one-letter code in Table 3D. The protein has a predicted 
30 molecular weight of 303424.3 daltons. 

Table 3D. Encoded FCTR3b protein sequence (SEQ ID NO:8). 

MDVKDRRHRSLTRGRCGKECRYTSSS 

CEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNSNKSDDENGRPIP 
PTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQSTLRPPLPPPHNH 
35 TLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTV 
YTPPPRLLPRNTFSRKAFKLKKPSKYCSWKCAALSAIAAA^ 

H I SQPQFLKFNI SLGKDALFGVYI RRGLPPSHAQYDFMERLDGKEKWS WES PRERRS I QTLVQNEAVFVQYLDVGLWHLAFYNDG 
KDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGVJKGAECDVPMNQCIDP 
SCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDTGLCSCDPNW 

40 MGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCTIGRQTAGTETDGCPDLCN 
GNGRCTLGQNSWQCVCQTGWRGPGCNVANETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVKS 
FYDRIKLLAGKDSTHIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYTITRQDGTFDLIANGGASLTLHFERAPF 
MSQERTVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPIIISSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLS 
SRTAGYKS LLKI TMTQSTVPLNL IRVHLMVAVEGHLFQKSFQAS PNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLI LW 

45 EKRTALLQGFELDPSNLGGWSLDKHHIIJtfV^^ 

GIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHSN^ 

TGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGSNDLTAVRPLSCDSSMDVAQTOLEWP 
TDIAWPMDNSLWLEN]WILRITEN^ 

VTTNGE I CLLAGAAS DCDCKNDVNCNCYSGDDAYATDAI LNS P S S LAVAPDGT I YI ADLGNI R IRAVS KNKPVLNAFNQYEAAS PG 
50 EQELYVFNADGIHQYTVSIiVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNL 
ELGLMTYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDD 
DQVRNSYQLCISnsrGTLRVMYANGMGIS^^^ 
SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGI^^ 

YSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVF 
55 YKYGKLSKLSEIWDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKP 

VI SETPLPVDLYRYDE ISGKVEHFGKFGVI YYDINQI ITTAVMTLSKHFDTHGRI KEVQYEMFRSLMYWMTVQYDSMGRVI KRELK 

LGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLHLLNPGNSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDI 

FEYNSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNIiGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDL 

EYWASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSN^^ 
60 KEPAPFNLYMFKSNNPLSSELDLKNYVTW 

AFMALEGQVITKKLHAS IREKAGHWFATTTP I IGKGIMFAIKEGRVTTGVSS IASEDSRKVASVLNNAYYLDKMHYS IEGKDTHYF 

VKIGSADGDLVTLGTTIGRKVLESGVNVW^ 

WAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 



35 



15966-697 



In further alternative embodiments the italicized bases in the 5' end of the FCTR3b 
sequence in table 3C is a variable region. This region can be substituted for in other 
embodiments of FCTR3. The nucleotide sequence for 9823bp FCTR3c (also referred to herein as 
10129612.0.154) has the same nucleotide sequence as FCTR3b except that the italicized region 
5 is replaced with the 201 base sequence shown in Table 3E. An ORF for the total FCTR3c 

nucleotide sequence was identified beginning with an ATG initiation codon at nucleotides 277- 
280 and ending with a TAG codon at nucleotides 8473-8475. This is the same open reading 
frame that is shown in Table 3C, with the corresponding base numbers for FCTR3c. This open 
reading frame will translate the same amino acid sequence as shown in Table 3C for FCTR3b. 

10 

Table 3E. Encoded FCTR3c 5'end nucleotide sequence (SEQ ID NO:9). 

GCTCCAAAGCGAGCTGGGACCGAAGACTCTAGGCTAAGTTATCTATGTAGATGGTGTCAGGGAGCGAAGCTACTGACCGA 
GCTGCTGTTACATCCAGCTTTTTAATTGCCTAAGCGGTCTGGGGCTTGCTTCGTCATTTGGCTTTGCTGTGGAGCACTCC 
TGTAAAGCCAGCTGAATTGTACATCGAAGATCCACCCTTTT 

15 

In yet another embodiment, the italicized region shown in the 5 ? end of the sequence in 
Table 3C can be replaced with the sequence shown in Table 3F to form 9823bp FCTR3d (also 
referred to herein as 10129612.0.67). An ORF was identified beginning with an ATG initiation 
codon at nucleotides 277-280 and ending with a TAG codon at nucleotides 8473-8475. This is 
20 the same open reading frame that is shown in Table 3C, with the corresponding base numbers for 
FCTR3d. This open reading frame will translate the same amino acid sequence as shown in 
Table 3D for FCTR3b. 

Table 3F. Encoded FCTR3d 5'end nucleotide sequence (SEQ ID NO: 10). 

GCTCCAAAGCGAGCTGGGACCGAAGACTCTAGGCTAAGTTATCTATGTAGATGGTGTCAGGGAGCGAAGCTACTGACCGA 
25 GCTGCTGTTACATCCAGCTTTTTAATTGCCTAAGCGGTCTGGGGCTTGCTTCGTCATTTGGCTTTGCTGTGGAGCACTCC 
TGTAAAGCCAGCTGAATTGTACATCGAAGATCCACCCTTTT 

In yet another embodiment, the italicized region shown in the 5' end of the sequence in 
Table 3C can be replaced with the sequence shown in Table 3G to form 9765 bp FCTR3e (also 
30 referred to as 10129612.0.258). An ORF was identified beginning with an ATG initiation codon 
at nucleotides 210-212 and ending with a TAG codon at nucleotides 8408-8410. This is the same 
open reading frame that is shown in Table 3C ? with the corresponding base numbers for 
FCTR3e. This open reading frame will translate the same amino acid sequence as shown in 
Table 3D for FCTR3b. 

35 Table 3G. Encoded FCTR3e 5'end nucleotide sequence (SEQ ID NO: 11). 

CCAGCATTAGATGAGTTGACAAAAATGCAGTTTCAGCTCT^ 
TTCTGCTACAACTATGACATCCATTTTCTCCCACTTCAGACAGGATGAATACAA 
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In yet another embodiment another FCTR3a homolog, FCTR3f (also referred to as 
10129612.0352) was found having the 9729bp sequence shown in Table 3H. An ORF was 
identified beginning with an ATG initiation codon at nucleotides 210-212 and ending with a 
TAG codon at nucleotides 8382-8384. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 3G, and the 
start and stop codons are in bold letters. 

Table 3H. Encoded FCTR3f nucleotide sequence (SEQ ID NO: 12). 

CCAGCTVTTAGATGAGTTGACAAAAA^ 

TT C TG CTACAACTATGACAT C CATTTT CT CC CACTTCAGACAGGATGAATACAAGGTGG CAAAGTGACAAGTG C CAAAAC 
TCAGGCGTGACTTTCCTGAAAACATCAGCATTCTGCCATATCTGGAATAA TGGATGTAAAGGACCGGCGACACCGCTCTT 
TGACCAGAGGACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACAG 
AAATCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCT 
CATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCC 
CACACCGAAGCGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCC 
GAGAC CGAGGGAGGGATGTCT C CAGAACACGC CATCAGACTGTGGGG CAGAGGGATAAAAT C CAGGCG CAGTT C CGGC CT 
GTCCAGTCGTGAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCA 
TTCCAGCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCAG 
ATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGAC^CCAACCCTGATGAGGAATTCTCCCCCAATTC^TACCT 
GCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCC 
CTCTCCCACCCCCTC^C^CCACACGCTGTCCCATCACC^CTC 

CGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAG 

CTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCA 

GCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTC 

TCCAGGAAGGCTTTGAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGpCATTGCCGCGGC 

CCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAG 

AAGTTGGTCGGCGGGTAACACAAGAAGTCCCACGAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCC 

TTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCATGCCCA 

GTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGA 

CCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAA 

GACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGA 

ATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCA 

GTGGGAATGGACAATATTCTAAAGGGACGTGCCAGTGC TACAGCGGCTGGAAAGGTGCAGAG TG CGACGTGC C CATGAAT 

CAGTGCATCGATCCTTCCTGCGGGGGGCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGA 

GCACTGTGAGGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCC 

CTGGCTGGGGTGGTCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTACCTGCCT 

GACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGTAGACTGTGGCAC 

TCACGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACC 

CCCGCTGCATTGAGCATGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTGCACCATT 

GATGGCTGCCCTGACTTGTGC^y^CGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGGCAGTGTGTCTGCCAGACCGG 

CTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGG 

ATTGTTTGGACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGGGACCCACTGGAC 

ATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAGCTCTTGGCAGGCAAGGATAG 

CACCCACATGATTCCTGGAGAGAACCCTTTCAACAGCAGCTTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATG 

GAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATCACCCGCCAGGATGGCACG 

TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTCATGAGCCAGGAGCGCACTGT 

GTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACC 

TCAGTGGCTTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 

CCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTC 

TAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCATTAGGGT 

TGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 

AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCC 

CAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCC 

TAGACAAACACGACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAAACCAGTTCCTG 

CCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGG 

CAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGAC 

GCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAACCCAGCACAC 

AAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAA 

GTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATG 

AAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGG 

CTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGA 

CCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTG 

CTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCGAGAACCACCAAGTCAGC 
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ATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTC 

GGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTC 
TACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTCAAT 
TGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGA 
TGGTACCATTTACATTGCAGACCTTGGAAATATTCG 

ACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGTGAGC 

CTGGTGACAGGGGAGTACTTGTACAATTTC&CATATAGTACT^ 

TTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTC 

TGGGCACCAATGGAGGCCTCAAAGTCGTGTCCAC^CAGAACCTGGAGCTTGGTCTCATGACCTATGATGGCAACACTGGG 
CTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGAC 
GCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTG 
ATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACMGATCAAGTTCGGAACAGC 
TACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGT 
CCTAGCGGGCACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGT 
GGCGCCTAAGAA^GGAACAGATTAAAGGCAAAGTCACCATCTTTQGCAGGAAGCTCCGGGTCCATGGAAGAAATCTCTTG 
TCCATTGACTATGATCGAAA.TATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGA 
CCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTTCAATGGGCGCC 
TGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCATCGTGTCCCGCATGTTCGCT 
GACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGA 
GTATGACTCCTCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCG 
GCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTC^TCTTTGACTACAGTGATGACGGCCGCATCCTG 
AAGACCTCCTTTTTGGGC^CCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGA 
CAGTACCGCCGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTCCT 
GCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGGAAGGCATGGTCAATGCC 
AGGTTTGACTACACCTATCATGACAAC^GCTTCCGCATCGCAAGCATC^AGCCCGTCATAAGTGAGACTCCCCTCCCCGT 
TGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACC 
AGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGAC^ 

TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCC 
CTATGCCAATACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGA 
CCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCCCTTGCGC 
TATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGATGGCTATCTGTGCCAGAGAGG 
GT CTGACATCTT CGAATACAATT C CAAGGG CCT C CTAACAAGAGC CTACAACAAGG C CAGCGGGTGGAGTGTCCAGTACC 
GCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACTTCTACTCTGACCTCCAC 
AACCCGACGCGCATCACCCATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCT 
CTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCA 
TCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATG 
GTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGC 
AGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCTGTATATGTTCA 
AGAGCAACAAT C C TC T CAGCAGTGAG CTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGA 
TTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAGAG 
TCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACATAACCAGGCCTTCATGGCTCTGG 
AAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATC^ 

ATTGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACGGGCGTGTCCAGCAT 
CAAGGTGGCATCTGTGCTGAACAACGCCTACTACCTGGACAAGATC 

TTGTGAAGATTGGCTCAGCCGATGGGGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAGAGAGCGGGGTG 
AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCAGTACTCCAC 
GCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGGCGAGAC 
AGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCCTGTGGACTGAG 
GG CGAGAAGCAG CAG CTT CTGAGCACCGGG CGCGTG CAAGGG TACGAGGGATATTACGTG CTT C C CGTGGAGCAATACCC 
AGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGAGGTA ACAAAATAATCTGCTGC 
CATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCGTAAGGAGATGAAGACCTAACAGGGGCACTGCG 
G C TGGG C TGCTTTAGGAGAC CAAGTGG CAAGAAAG CTCACAT TT TT TGAGTT CAAATGC TACTGTC CAAG CGAGAAG T CC 
CTCATCCTGAAGTAGACTAAAGCCCGGCTGAAAATTCCGAGGAAAACAAAACAAACGAATGAATGAACAGACACACACAA 
TGTTCCAAGTTCCCCTAAAATATGACCCACTTGTTCTGGGTCTACGCAGAAAAGAGACGCAAAGTGTCCAAAAGGAACAA 
AAGAACAAAAACGAATAAGCAAAGAAGAAAACAAAC^ 

GAAGATAAGAAAGAAGGCCTCATATCCAATTACCTCACTCATTCACATGTGAGCGACACGCAGACATCCGCGAGGGCCAG 
CGTCACCAGACCAGCTGCGGGACAAACCACTCAGACTGCTTGTAGGACAAATACTTCTGACATTTTCGTTTAAGCAAATA 
CAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGGGGATAAAAGAGGAGGAGTGAGCA 
CTGGAAATACTTTTTAAAGAAAAAAAAACATGAGGGAATAAAAGA 

CCAGCACTTAACTCTCAGGTCCCAACTAAGTCTGGCCTGAGCTAATTTATTTGAGCGCAGAGTGTAAAATTTAATTCAAA 
ATGGTGG CTATAATCACTACAGATAAATTT CATACTCTTTTGTCTT TGGAGATT C CATTGTGGACAGTAATACGCAG TTA 
CAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGTATCAGTTTCGGTAGAGGTGCAGCATCGTGACACTTTTGCTAAC 
AGGTACCACTTCTGATCACCCTGTACATACATGAGCCGAAAGGCACAA^ 
TTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGA 

AAAAATTTTAAAAATTAAAAAAACAAAAATAAAGTCTAATAAGAACTTTGGTACAGGAACTTTTTTGTAATAT^ 

TGAATTGTTCATCGAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGGCAAAGATAATTTATGGC 

AAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 
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The FCTR3f polypeptide (SEQ ID NO:13) encoded by SEQ ID NO:12 is 2724 amino 
acid residues long and is presented using the one-letter code in Table 31. This sequence differs 
from FCTR3b in that it is missing amino acids 758-766 from that polypeptide. 

Table 3L Encoded FCTR3f protein sequence (SEQ ID NO;13) 

5 MD VKDRRHRS LTRGRCGKE CR YT S S S LD S E DCRVPTQ KS YS S S E TLKAYDHD S RMH YGNRVTDL I HRE S DE F P RQGTNFT LAE LG I 
CEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENGRPIP 
PTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQSTLRPPLPPPHNH 
TLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSG 
YTPPPRLLPRNTFSRKAFKLKKPSKYCSWKCAALSAIAAALLLAILLAYFIVPWSLKNSSIDSGEAEV 
10 HI SQPQFLKFNI S LGKDALFGVY IRRGLPPSHAQYDFMERLDGKE KWS WE SPRERRS I QTLVQNEAVFVQYLDVGLWHLAFYNDG 
KDKEMVSFNTWLDSVQDCPRNCHGNGE 

SCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDTGLCSCDPNW 
MGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCTIDGCPDLCNGNGRCTLGQ 
NSWQCVCQTGWRGPGCWAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVKSFYDRIKLLA 
1 5 GKDSTH 1 1 PGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYT I TRQDGTFDLIANGGASLTLHFERAPFMSQERTWL 
PWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPIIISSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSL 
LKITMTQSTVPLNLIRVHLMVAVEGHLFQKSFQASPNLASTFIWDKTDAYGQRWGL 

FELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLrTQQPAI ITS IMGNGRRRS ISCPSCNGLAEGNKLLAPVALAVGIDGSLYVG 
DFNYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYYLAVDPV^ 
20 EARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGSNDLTAWPLSC^ 
NSLWLENOTILRITENHQVSI IAGRPMHCQVP^ 

LAGAASDCDCKNDVNCNC YSGDDAYATDAI LNS P S SLAVAPDGT I YI ADLGNI R I RAVS KNKP VLNAFNQ YEAAS PGEQEL YVFNA 
DGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLW^ 

NTGLLATKSDETGWTTFYDYDHEGRLT1WTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTWQDQVRNSY 
25 CNNGTLRVMYANGMGISFHSEPHVIAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIF 
TEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYF 

LLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKL 
SEIWDSTAVTFGYDETTGVLKMWLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGiyiVNARFDYTYHDNSFRIASIKPVISETPLPV 
DLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTK 

30 YTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLHLLNPGNSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLL 
TRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYlSrHSNSEITSLYYDLQGHLFAMESSSG 
GTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVIAGRWTSPDYTMWKOT 
MFKSNNPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAFM^ 
I TKKLHAS I RE KAGHWFATTTP I IGKG IMFAI KEGRVTTGVS S I ASEDSRKVAS VLNNAYYLDKMHYS I EGKDTHYFVKIGSADGD 

35 LVTLGTTIGRKVLESGVNVTVSQPTLLWGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKAR 
DGREGS RL WTEGE KQQLLS TGR VQG YEGY YVLP VEQ YPELADS SSN I Q FLRQN EMGKR 



In a BLASTN search it was found that the FCTR3a nucleic acid has homology to three 
fragments of Mm musculus odd Oz/ten-m homolog 2. It has 634 of 685 bases (92%) identical to 
40 bases 614-1298, 365 of 406 bases (89%) identical to bases 1420-1825, and 93 of 103 bases 

(90%) identical to bases 1823-1925 of Mus musculus odd Oz/ten-m homolog 2 (GenBank Acc: 
NMJ)1 1856.2) (Table 3 J). 

Table 3 J. BLASTN of FCTR3a against Mus musculus odd Oz/ten-m homolog 2 (SEQ ID 

NO:62) 

45 > GI 1 7657414 | RSF | NM 011856. 2| MUS MUSCULUS ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) (ODZ2) , 

MRNA 

LENGTH = 8797 

SCORE = 954 BITS (481), EXPECT =0.0 
50 IDENTITIES = 634/685 (92%) 

STRAND = PLUS / PLUS 

QUERY: 114 GGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGC 173 

III I III MM III 1 1 MM! I Mill I II I MIIMMMIIMIIIIIIIIMIM 

55 SBJCT: 614 GGTCGTCCCATTCCACCTACATCCTCGTCTAGCCTCCTCCCATCTGCTCAGCTGCCTAGC 673 
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60 



65 





1 7 A 

X, / Tt 


t p r p a t a at p pt cc ac c ap*ttagctgccagatgc cat TGCTAGACAG CAACACCTC c CAT 


233 


o rj tpt . 


CIA 


1 1 1 1 II ! 1 1 II 1 1 II 1 1 1 1 1 II ! 1 ! M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 M M 1 1 1 1 II M M 

TCCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCAT 


733 


HTTT7DV - 




P A A AT P A TPP A P A CC A A CC C TGATGAGGAATTCT C C CC CAATT CATAC CTGC T CAGAG CA 

II 1 1 1 1 1 1 1 1 1 1 1 1 L !! 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

CAGAT CATGGACAC CAAC C CTGATGAGGAATT CT CC C C CAAT T CATAC CTG CT CAGAG CA 


293 


OJDU X . 


7^ A 


793 


nTTT?PV - 


0 QA 


tpptpapppppppagpaagpptppagpagtggccctccgaaccaccacagccagtcgact 


353 


<5R.TPT * 
OxjU X . 


7 QA 


IIIIIIIIIII1IMIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIMII II 

TGCTCAGGGCCCCJ^GCAAGCCTCCAGCAGTGGCCCTCCAA^ 


853 




1 CA 


PTPAPPPPPPPTPTPPPAPPPPPTPAPAAPPAPAPGPTGTPPPATPAPPAPTPPrTCCGCC 


413 






IIIMIIIMilll IMIIIIIIII IIIIIIM llllllll lllillll II III 

PTP APPPPPPPTPTPPP APPPPPTP AT A APPAPAPPPTPTPPPAPPAPPAPTPPTPGGPC 


913 


PT7TP"D V • 




AAPTPPPTPAAPAPPAAPTPAPTPAPPAATPPtPtPPtRAGTPAGATPPACGCCCCGGCCCCA 


473 


CD TPT . 


Q"l >t 

y x^t 


IIIMMIIIIIMMIIIIIIMMIMMIIIIIIMM IMIIIIIIII 11 II 

AACTCCCTCAACAGGAACTC^CTGACCAATCGGCGGAGT 


973 


ATTPPV • 


*± / *± 


PtPPt CC P A ATPrAPPTRPr PPAP CACACCAGAGTCCGTT CAG CTT CAGGACAGCTGGGTGCTA 

IIIMIII llllllllllllll llllllll llllllll Mill IMIIIIIIII 

P PP P P P A A PP, A PPTPrP, P P A P P A P P P P APrAGT PTGTTPAGPT P CAGGATAG CTGGGTGC TG 


533 


bDdtl : 


G7A 
2? /ft 


1033 






A A PAP, PA A PPTPrPP A PTPP AP APPPPPrP APTTPPTPTTP AAGAPPTCCTCGGGGAGCACA 


593 


CD TPT 1 . 




Mill Mill IIIIIMIIII MIIMIMM Mill II II II II Mill! 

AAPAPTAAPPTPPPAPTPPAPrAPTPnPtPAPTTPPTTTTPAAAAPGTPGTPTGGAAGCACA 


1093 


ATT17DV . 


C G /t 


X X\J X X ^HuLriOL X v, X X LuLLuuUriiriv. | wU XXX VJi J ri\-.\— X Wlwwln.\->OU X X X riv^nv-uv-u^ 


653 


pn T/" 1 T' . 

b-bJCx : 


X Ui?4 


Ml IIIIIIIMIIIIMI Mil II'MMHII MMMII II Mill II II 

PPPPfPTTPAPPaPPTPTTPTPPPPP AT APPPTTTPAPPTPAPrPrPAPPPrTTT AT APAPP A 


1153 


yUiVKi : 


£ C A 


PPP PPPPPPPTPPTPPPP APP A AT APTTTPTPPAPP A APPPTTTP A APPTPAAGAAGPPP 


713 




1 1 c: A 


II MIIMIIIIIMI lllllll IIIIIMMIMM IMIIIIIIIIIII Ml 

PPAPPPPPPPTPPTPPPAPPPAATAPATTPTCPAGGAAGGCCTTCAAGCTGAAGAAACCC 


1213 




714 


TCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGCCATTGCCGCGGCCCTCCTCTTG 


773 




Xj£ -Lft 


IIIIIIIIIIMM IMIIMIIMMMM II Mill Mill IIIIMMIMI 

TPPAAATAPTPPAPTTPPAAATPTPPTPPPPTGTPTGPPATPGCPGCCGCCCTCCTCTTG 


1273 


QUERY: 


774 


GCTATTTTGCTGGCGTATTTCATAG 798 




SB JCT : 


1274 


II IIIIMMIM MINIUM 

GCCATTTTGCTGGCATATTTCATAG 1298 





10 
15 
20 
25 
30 
35 
40 

45 SCORE = 480 BITS (242) , EXPECT = E-132 

IDENTITIES = 3S5/406 (89%) 
STRAND = PLUS / PLUS 

iUERY: 797 AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 85S 

50 | M 1 1 1 1 II M I II I II 1 1 1 II I II I II I II 1 1 1 II I II M I II II I II II 1 1 1 1 1 II 

jTGC C CTGG T CAT TGAAAAACAG CAG CATAGACAGTGG CGAAG CAGAAGT TGGT CGGCG 1479 
^TAACACAAGAAGTCCaCCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCC 916 

II Mill IMMMMMMIMMMMMIMM II II 1 1 1 i 1 1 1 1 1 1 1 1 1 II 

jTGACACAGGAAGT CC CACCAGGGGTGTTTTGGAGGTCCCAGATTCACATCAGTCAGCC 153 9 
^AGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 976 

II IMIIMMMMMMIIMM 1 1 Mill II Mill INN II Mill 

^AATTCTTAAAGTTCAACATCTCCCTGGGCAAGGATGCCCTCTTCGGTGTCTATATAAG 1599 
^GAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 103 6 

MMMII Mill IMMIIMIIIMIIMIIIMMIIM Mill M Mill 

^GAGGACTACCACCGTCTCATGCCCAGTATGACTTCATGGAACGCCTGGATGGAAAGGA 1659 
^GTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAA 1096 

II Mill Mill Mill t ! 1 M 1 1 M 1 1 1 i 1 1! 1 1 1 [ Mill MM Mill 

JVAATGGAGCGTGGTCGAGTCGCCCAGGGAACGCCGGAGCATCCAGACTCTGGTGCAGAA 1719 
3AAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGCCATCTGGCCTTCTACAATGA 1156 

70 || M | HI HI | HI MM || MMMII III I III M I III till Ml I III 1 1 

40 15966-697 



QUERY : 


797 


SB JCT : 


1420 


QUERY: 


857 


SB JCT : 


1480 


QUERY: 


917 


SB JCT : 


1540 


QUERY: 


977 


SB JCT : 


1600 


QUERY : 


1037 


SB JCT : 


1660 


QUERY: 


1097 



SBJCT: 1720 CGAGGCTGTGTTTGTGCAGTACTTGGATGTGGGCCTGTGGCACCTGGCCTTCTACAATGA 1779 
QUERY: 1157 TGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGAT 1202 

I! II I M II IMIIIII MINIM MMMMI MMI 

SBJCT: 1780 CGGCAAGGACAAGGAGATGGTCTCCTTCAACACTGTTGTCTTAGAT 1825 



SCORE a 125 BITS (63), EXPECT = 7E-26 
IDENTITIES = 93/103 (90%) 
STRAND = PLUS / PLUS 



QUERY : 


1258 


GATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTG 


1317 


SBJCT : 


1823 


MMMMMMMMMMMI MM! M MMI MMMM MMI M II 

GATTCAGTGCAGGACTGTCCACGGAACTGTCACGGGAACGGTGAATGCGTGTCTGGACTG 


1882 


QUERY : 


1318 


TGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAG 1360 

IMMMMMMMMMI Mill MIMMMIMMM 

TGTCACTGTTTCCCAGGATTCCTAGGTGCAGACTGTGCTAAAG 1925 




SBJCT : 


1883 





In another BLASTN search it was found that the FCTR3a nucleic acid has homology to 
three fragments of Gallus gallus mRNA for teneurin-2. It has 541 of 629 bases (86%) identical 
to bases 502-1130, 302 of 367 bases (82%) identical to bases 1330-1696, and 87 of 103 bases 
(84%) identical to bases 1711-1813 of Gallus gallus mRNA for teneurin-2 (EMBL Acc: 
AJ24571 LI) (Table 3K). 

Table 3K. BLASTN of FCTR3a against Gallus gallus mRNA for teneurin-2 (SEQ ID 

NO:63) 

> GI 1 601004 8 1 EMB|AJ245711 .1 [GGA245711 GALLUS GALLUS MRNA FOR TENEURIN-2, SHORT SPLICE 
VARIANT (TEN2 GENE) 

LENGTH = 2496 



SCORE = 549 BITS {211) r EXPECT * E-153 
IDENTITIES = 541/629 (86%) 
STRAND = PLUS / PLUS 



QUERY : 


114 


GGTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGC 

IIIIIMMIIIIMIimilllMI MM 1 1 MMMMMMMMMM M 

GGTCGTCCCATTCCACCTACATCCTCGTCTAGCCTTCTCCCATCTGCTCAGCTGCCCAGT 


173 


SBJCT : 


502 


561 


QUERY: 


174 


TCCC^TAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCAT 

M MMMMMMMMMMMMMMMMMMMMMMMI 1 1 MMM 

TCTCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAATACGTCCCAT 


233 


SBJCT : 


562 


621 


QUERY: 


234 


CAAATCATGGACACGAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCA 


293 


SBJCT : 


622 


1 1 1 1 1 1 1 } 1 1 1 1 1 1 1 1 1 him inn iiiii ii milium n mm 

CAAATCATGGACACCAATC CTGACGAGGAGTTCTCT CCTAATTCATACCTACTAAGAGCA 


681 


QUERY: 


294 


TGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACT 

ii iiiimi iiiii ii iiiiiimiiiiii i iiiii iiimmii ii 

TGTTCAGGGCCACAGCAGGCATCCAGC^GTGGCCCTTCAAACC^^ 


353 


SBJCT : 


682 


741 


QUERY: 


354 


CTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCC 


413 


SBJCT : 


742 


MMMM MMMM M MMMMMM MMMMMMI MMMM Ml 

CTGAGGCCACCTCTCCCCCCTCCTCACAACCACTCGCTGTCCCATCATCACTCGTCTGCC 


801 


QUERY : 


414 


AACTCCCTCAACAiGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCA 


473 


SBJCT : 


802 


MMMMMMMMMM M MMI M II 1 IIIIIIIMM II II II 

AACTCCCTC^CAGGAACTCGCTCACCAACCGCCGCAACCAGATCCACGCGCCTGCTCCC 


861 


QUERY: 


474 


GCGCCC^TGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTA 


533 


SBJCT : 


862 


II llllllllllllll MM! II Mill II MMI MMMMMMIMM 

GCTCCCAATGACCTGGCGACCACGCCTGAGTCTGTGCAGCTGCAGGACAGCTGGGTGCTC 


921 



41 15966-697 



10 



35 



40 



45 



50 



55 



60 



QUERY: 


534 


SBJCT : 


922 


QUERY : 


594 


SBJCT : 


982 


QUERY : 


654 


SBJCT : 


1042 


QUERY : 


714 


SBJCT : 


1102 



1 1 i I i 1 1 ! 4 4 E 1 1 IIIIIIIM 1 1 1 1 III I II Mill II II II I I 

^CAGCAACGTGCCGCTGGAGACCAGGCATTTCTTGTTTAAGACATCTTCTGGAACGAC 
CCTTGTTCAGCAGCTCTTCCCCGGGATACCCTTTGACCTCAGGAACGGTTTACACGCC 

I lllllll IIIIMMIII II Mill MMMIIIMM Mill II I 

CGCTGTTCAGTAGCTCTTCCCCTGGCTACCCACTGACCTCAGGAACAGTTTATACTCC 
CGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTTTCAAGCTGAAGAAGC( 

I III I III I II II Mill II llllllll II M I ; I II I II i MM 

CTCCCAGGCTGTTACCTAGAAATACATTTTCC^GGAATGCATTCAAGCTGAAAAAGCt 
CCAAATACTGCAGCTGGAAATGTGCTGC 742 

15 Mill II II Mill IIIMIIIIII I 

SBJCT: 1102 TCCAAGTATTGTAGCTGGAAATGTGCTGC 1130 

SCORE = 212 BITS (107), EXPECT = 4E-52 
IDENTITIES = 302/367 (82%) 
20 STRAND = PLUS / PLUS 

25 

30 



QUERY: 


819 


AGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCGGGTAACACAAGAAGTCCCACCA 
1 M M M II I! U U 1 1 E i 1 1 1 1 M M II 1 I Ml II Mill M M II 

1 M II II 1 II 1 II 1 1 1 III 1 II M M 1 II M Ml II 1 II II II II II 

AGCAG CATAGATAGTGGAGAAACAGAAGTTGGC CG C AAGGTCAC C CAAGAGGTGCCC CCT 


878 


SBJCT : 


1330 


1389 


QUERY : 


879 


GGGGTGTTTTGGAGGTCAGAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCAACATC 


938 


SBJCT : 


1390 


II MM! Ill MM 11 II II Mill Mill llllll 1 IMIMIMM 

GGAGTGTTCTGGCGGTCTCAGATCCATATCAGCCAGCCACAGTTCCTGAAGTTCAACATA 


1449 


QUERY: 


939 


TCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCAT 


998 


SBJCT : 


1450 


Mill llllllll Mill II llllllll IMMimilMI llllllll Ml 

TCCCTAGGGAAGGATGCTCTTTTCGGTGTTTATATAAGAAGAGGACTCCCACCATCACAT 


1509 


QUERY : 


999 


GCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCT 


1058 


SBJCT : 


1510 


II Ml III MIIMIMM MM Mill Mill IIIIMMIII M II 

GCACAGTATGATTTCATGGAACGCTTGGATGGGAAAGAGAAATGGAGTGTGGTGGAATCC 


1569 


QUERY : 


1059 


CCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTAC 


1118 


SBJCT : 


1570 


II MIIMI 11 II II Mill 1 II M 1 1 M II 1 11 IMIMM MUM 

CCACGGGAACGGCGAAGTATTCAGACTCTTGTTCAGAATGAGGCTGTGTTTGTTCAGTAC 


1629 


QUERY : 


1119 


CTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTT 


1178 


SBJCT : 


1630 


iiiiiiiiii iiiMii mil ii milium ii iiMiiii mi 

TTGGATGTGGGT TTGTGG CAC C TGGCGT TT TACAATGATGGCAAGGACAAAGAAGTGGTC 


1689 


QUERY : 


1179 


TCCTTCA 1185 




SBJCT : 


1690 


MIIMI 
TCCTTCA 1696 




SCORE - 77.8 BITS (39), EXPECT = 1E-11 
IDENTITIES = 87/103 (84%) 
STRAND = PLUS / PLUS 




QUERY : 


1258 


GATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTG 

iiiiiiiiiii iiiiiiiiiiiiii ii mil inn ii urn ii ii ii 

GATTCAGTGCAAGACTGTCCACGTAATTGTCATGGCAATGGCGAGTGTGTTTCTGGTGTC 


1317 


SBJCT : 


1711 


1770 


QUERY : 


1318 


TGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCTAAAG 1360 




SBJCT : 


1771 


II IMIMM M lllllll llllllll IIIIIIMM 

TGCCACTGTTTTCCCGGATTTCATGGAGCAGATTGTGCTAAAG 1813 





In this search it was also found that the fragments of FCTR3bcd and e nucleic acids had 
homology to three fragments of Homo sapiens mRNA for KIAA1 127 protein. It has 5537 of 
5538 bases (99%) identical to bases 1-5538, 705 of 714 bases (98%) identical to bases 5609- 



42 
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6322, and 176 of 176 bases (100%) identical to bases 6385-6560 of Homo sapiens mRNA for 



KIAA1 127 protein (GenBank Acc: AB032953) (Table 3L). 

Table 3L. BLASTN of FCTR3b, c, d, and e against Homo sapiens KIAA1127 mRNA (SEQ 

IDNO:64) 

> GI 1 6329762 [DBJ | AB032953 . 1 | AB032953 HOMO SAPIENS MRNA FOR KIAA112 7 PROTEIN, PARTIAL 
CDS 

LENGTH = 6560 



SCORE = 1.097E+04 BITS (5534), EXPECT =0.0 
IDENTITIES a 5537/5538 (99%) 
STRAND = PLUS / PLUS 



QUERY : 


3267 


CACCTTCTTTAGTGCTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCA 

1 1 ! i 1 1 1 1! 1 M 1 1 i 1 i M 1 i M IE 1 1 1 1 1 ! 1 ! M ! 1 i ! 1 1 I M M 1 ! It 1 i 1 1 I M 1 II 


3326 


SBJCT : 


1 


1 M M II 1 II f 1 1 I M ! 1 M 1 M 1 1 1 I 11 1 11 1 1 t 1 1 11 i 11 1 i 11 1 M 1 1 M t 1! 1 11 1 
CACCTTCTTTAGTGCTGCCCCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCA 


60 


QUERY : 


3327 


TGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTCTAGAACTGC 


3386 


SBJCT: 


61 


iiiimiiiiiiiiiMiiiimiiimiiiiiimiiiiiiiiMimiiiiii 

TGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTCTAGAACTGC 


120 


QUERY : 


3387 


AGGGTAO^GTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCAT 

! ! 1 1 1 1 1 1 1 1 1 M M M ! M M i 1 1 ! ! i 1 ! M ! 1 ! 1 1 1 1 M I 1 1 1 1 1 I 1 1 1 1 1 II I 1 1 1 I 
1 1 1 II 1 l 1 1 l 1 M M M M M M 1 M 1 1 1 1! 11 1 M 1 1 ! M 1 1 M 1 1 II II 11 1 II 1 1 1 I 

AGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCAT 


3446 


SBJCT : 


121 


180 


QUERY : 


3447 


TAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTC 
1 11 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 i 1 II 1 M 1 1 M II II i 1 i 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 


3506 


SBJCT : 


181 


1 1 1 1 1 1 1 1 1 1 1 1 f 1 i 1 1 1 1 1 1 1 II I 1 M 1 1 M M 1 II 1 1 M 1 1 I M M 1 1 I 1 M M i 1 M 
TAGGGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTC 


240 


QUERY : 


3507 


TCCCAACCTGGCCTCCACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTA 

II 1 1 1 1 1 1 1 M II 1 1 1 II 1 1 1 1 1 II 1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 II 1 1 


3566 


SBJCT : 


241 


M 1 II 1 1 M M 1 1 i 1 1 II M M 1 1 S 1 I 1 1 M M 1 1 1 1 M M 1 II 1 1 i 1 I 1 1 ! 1 I 1 1 1 1 1 
TCCCAACCTGGCCTACACCTTC^TCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTA 


300 


QUERY : 


3567 


TGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAAT 


3626 


SBJCT: 


301 


MllllllMIIIIMIMIIMMMIMIIIIMillMMIIIMIMIIMIIMI 

TGGACTCTCAGATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAAT 


360 


QUERY: 


3627 


T C T CTGGGAGAAAAGGACAG CC CT CCT T CAGGGATT CGAG CTGGACC C CT C CAAC CT CGG 

1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II 1 1 M II II 1 1 1 II I 1 M 11 1 1 M 1 1 M M 1 1 1 1 I II 1 1 I I 1 1 M ! 1 1 1 M 1 1 1 1 1 1 1 

TCTCTGGGAGAAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGG 


3686 


SBJCT: 


361 


420 


QUERY : 


3687 


TGGCTGGTCCCTAGACAAACACCACATCCTCAATGTTAAAAGTGGAATCCTACACAAAGG 


3746 


SBJCT : 


421 


IIIMIIIIMIIIMIIIIIIMIIIIIIMIMIIIMMMMIMIMIIMMM 

TGGCTGGT CC CTAGACAAACACCACATCCTCAATGTTAAAAGTGGAAT C CTACACAAAGG 


480 


QUERY : 


3747 


CACTGGGGAAAACCAGTTCCTGACCCAGCAGCCTGCCATCATCACCAGCATCaTGGGCAA 


3806 


SBJCT : 


481 


IMIMMIIIIIIIIIMIIMMIIIIIMIIIIIIIIMMMIIIIMIIIIIIll 

CACTGGGGAAAACCAGTTCCTGACCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAA 


540 


QUERY : 


3807 


TGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCT 


3866 


SBJCT : 


541 


IMMIIIIIIIIIIIIMIMIIMMIIMIIIIIIIIIIIIIMMMIIIIIIMI 

TGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCT 


600 


QUERY : 


3867 


GCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAA 

IMIIMIMIIMlMIIIIIIIIMIMMIIIMIIMiMIIIIIIIMIIIIMI 

GCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAA 


3926 


SBJCT : 


601 


660 


QUERY : 


3927 


TTACATC CGACG CAT CTTT CC CT CTCGAAATGTGAC CAGCAT C TTGGAGTTACGAAATAA 


3986 


SBJCT : 


661 


IIIMMllMIIIIIIIIMIIIIIIIIIMIMIMMIIIIMMIMMIIIIMI 

TTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAA 


720 


QUERY : 


3987 


AGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTC 

MMIMMMIIIIIIMMMMIIMMMIIIIIIMMIIIMIIIMMMMI 

AGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTC 


4046 


SBJCT : 


721 


780 



QUERY: 404 7 CGGCTCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAAGTCTCTGAG 4106 

43 15966-697 



MMIMIIIIIMIIimilMIMMllllllMIIIIMIMMIIMMIIIMI 

SBJCT: 781 CGGCTCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTAGCGCGTCAAGTCTCTGAG 840 
QUERY: 4107 TGGAACCAAAGACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCT 4166 

IIIIIMMIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIM 



IIIIIMIMIIIIIMIllllllllMIIIIIIMIMIMIIIIMIIIIMIIill 

^CCCTTTGATGAAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAG 960 
rCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGAT 42 86 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIMMIMMIIilMIIIIIMlllllMI 

rCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGAT 1020 
:CGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGC 4346 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIMIMIIIMIimi 

:CGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGC 1080 
:GTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCC 4406 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i I i 1 1 1 1 f I i 1 1 1 i 1 1 1 E 1 1 1 1 i 1 1 1 T 1 1 1 1 

:GTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCC 1140 
^ACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCAT 4466 

! M II I M I M M 1 1 ! M 1 1 II 1 1 ! I ! I M I II ! 1 i 1 1 M ! 1 1 M I ! I M 1 1 1 1 1 1 1 1 1 

^ACAGAC CTTG CTGTCAATC C CATGGATAACT CCTTGTATGTT CTAGAGAAGAATGT CAT 12 00 
^C TTCGAAT CACCGAGAACCAC CAAGT CAGCAT CATTG CGGGACG C CCCATGCACTG C CA 4526 

IMIMMMIIIMMMIIIIIMIMMIIMIIIMIMIIIIIIIMIIMMII 

HCTT CGAAT CACCGAGAAC CAC CAAGTCAGCAT CATTG CGGGACGC C CCATGCACTGC CA 1260 
\GTTCCTGGCATTGACTACTCACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGC 4586 

f 1 1 ! I M I M 1 1 1 1 11 M I ! 1 1 M ! t II 1 1 1 1 1 1 1 1 1 E I M I E 1 1 1 1 1 E { I M I M 1 1 ! I 

^GTTCCTGGCATTGACTACTCACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGC 13 20 
IAGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAA 4646 

I i M 1 1 M M M M M 11 1 1 1 1 M 11 M t M M I! 1 1 1 M M 1 1 M 1 1 i 1 1 1 ! f t i 1 1 1 1 

I!AGTGCCATTGCCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAA 13 80 
3ATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGC 4706 

IIMIMIIIIMMIIIIIIIIIIMMIIIIIIIIMIIIIMIIIIIMIIIIIMI 

jATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGC 1440 
^TCGGACTGCGACTGC^AAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTA 4766 

I i 1 1 1 1 1 i 1 1 1 1 { f 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 f 1 1 1 1 1 i I i 1 1 1 1 1 i M I f i f i I f f I f f 1 1 r ! 

^TCGGACTGCGACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTA 1500 
ZG CGACTGATGC CAT CTTGAATT C CC CAT CAT C CTTAG CTGTAG CT C CAGATGGTAC CAT 4826 

1 1 1 [ 1 1 1 i 1 1 M 1 1 f 1 1 1 1 ! 1 1 1 1 1 1 i 1 1 1 M 1 1 i 1 1 1 1 1 E 1 1 ! } 1 1 1 M 1 i f i 1 1 1 M I 

^GCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCAT 1560 
rTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCT 4886 

M I M M I M 11 1 1 M M ! M 1 1 f f i M I ! I i I M ! 11 ! M I M i I M I M M M 1 1 1 i i 

rTACATTGCAGACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCT 162 0 
rAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAA 4946 

1 1 1 f 1 1 [ 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 f E i 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 E I f 1 1 1 

rAATGCCTTCAACCAGTATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAA 1680 
^GCTGATGGCATCCACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTT 5006 

MIMIMMIIIIIMMIIIIIIMMIIIMIIIMIIIIIIMIIIMMMIIM 

^GCTGATGGCATCCACCAATACACTGTGAGCCTGGTGACAGGGGAGTACTTGTACAATTT 174 0 
^ACATATAGTACTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAA 5066 

IIIIIMIIIIMIIIIIMMIIMIIIIIIIMilllMIIIIMIIIIIIIIIIlll 

2ACATATAGTACTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAA 1800 
3ATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCAGATCAT 5126 

MMIIIIIIIIIIIIMMIlllllMMIMIIillMllMMIUllllllMIII 

ZAT C CGT CGGGACAG CAGTGG CATGC CC CG T CAC CTGCT CATGC CTGACAAC CAGAT CAT 1860 

44 15966-697 



SBJCT : 


841 


QUERY: 


4167 


SBJCT : 


901 


QUERY: 


4227 


SBJCT : 


961 


QUERY: 


4287 


SBJCT : 


1021 


QUERY : 


4347 


SBJCT : 


1081 


QUERY : 


4407 


SBJCT : 


1141 


QUERY : 


4467 


SBJCT : 


1201 


QUERY : 


4527 


SBJCT : 


1261 


QUERY : 


4587 


SBJCT : 


1321 


QUERY: 


4647 


SBJCT : 


1381 


QUERY: 


4707 


SBJCT : 


1441 


QUERY: 


4767 


SBJCT : 


1501 


QUERY : 


4827 


SBJCT : 


1561 


QUERY : 


4887 


SBJCT : 


1621 


QUERY : 


4947 


SBJCT : 


1681 


QUERY : 


5007 


SBJCT : 


1741 


QUERY : 


5067 


SBJCT : 


1801 



10 



20 



25 



30 



40 



45 



50 



60 



65 



70 



QUERY : 


5127 


SB JCT : 


1861 


QUERY : 


5187 


SB JCT : 


1921 


QUERY : 


5247 


SB JCT : 


1981 


QUERY: 


5307 


SBJCT: 


2041 


QUERY : 


5367 


SBJCT : 


2101 


QUERY: 


5427 


SBJCT : 


2161 


QUERY: 


5487 


SBJCT : 


2221 


QUERY: 


5547 


SBJCT : 


2281 


QUERY : 


5607 


SBJCT : 


2341 


QUERY : 


5667 


SBJCT : 


2401 


QUERY : 


5727 


SBJCT : 


2461 


QUERY : 


5787 


SBJCT : 


2521 


QUERY: 


5847 


SBJCT : 


2581 


QUERY : 


5907 


SBJCT : 


2641 


QUERY: 


5967 


SBJCT : 


2701 


QUERY : 


6027 


SBJCT : 


2761 


QUERY : 


6087 


SBJCT : 


2821 


QUERY: 


6147 



IIIIIIIIIMMMIIIIIMMMIIMMMMIMIIMMIIIIIMIIIIIII 

^CCCTCACCGTGGGCACCAA-TGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCT 
3GTCTCATGACCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGG 

I M M I ! I M M I M M t M 1 i [ it 1 1 1 11 1 1 M M E i 1 M 1 1 M 1 1 M i 1 1 M 11 1 j I 

jGTCTCATGACCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGG 
rGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCAC 

IIMMMIIMIMIIIMIIIIIMIIIIIIIIIIIIMIMIIMMIIIIIIIII 

rGGACGACTTTCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCAC 
3GGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAA 

15 ~ MMMIIMM MIMIMMIIMMMMMIM MIMMMMMMMMIM 

aGGGTGGTAACCAGTCTGCACCGGGAAATGGAGAAATCTATTACCATTGACATTGAGAA 
I* CCAACCGTGATGATGACGT CACTGTCAT CAC CAACCT CTCTT CAGTAGAGG C CT C CTA 

M M:MM!IM MMMIIMIMMIMMIIIM MMIMIMMII MMIM 

TCCAACCGTGATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTA 
^CAGTGGTACAAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAG 

1 1 L 1 1 1 i i 1 1 i 1 1 1 i 1 1 i I i 1 1 1 i 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 E ! 1 1 1 1 M 1 1 f f 1 1 f i I ) i I 

^CAGTGGTACAAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAG 
GTGATGTATGCTAATGGGATGGGTATCAGCTT CCACAG CGAGC C C CATGT CCTAG CGGG 

1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 i 1 1 f i f ! 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ! I f i I i 1 1 1 

GTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTAGCGGG 
ACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAA 

1 1 1 1 1 [ I M 1 1 1 1 1 i 1 1 1 T 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 E 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 

ACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAA 
TCCATTGAGTGGCGCCTAAGAAAGGAAC^GATTAAAGGCAAAGTCACCATCTTTGGCAG 

35 ~ 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1! 1 1 1 1 ! M ! 1 1 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 ! 1 1 1 

:CATTGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAG 
^GCTCCGGGTCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGAC 

1 1 M I i M 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 M 1 1 ; I M M 1 1 1 1 1 1 

^GCTCCGGGTCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGAC 
\AAAGATCTATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGG 

MMIM IMMMMMMMMMMIM MMMM IMMMMMM I 

\AAAGATCTATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGG 
jCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTT 

MIMIMMIM MIMMIMMMIMMM IMMMMIMI'MMIiMM 

jCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTCAACGTGTCATACTTCTT 
\TGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAA 

M MM! MMIM IMMINIMMMMMMMMMMMMMMMM 

^TGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAA 
^AGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCT 

55 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M II 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 

^AAGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCT 
jACAAGTCCATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTC 

IIIIIIIIIMIillllMllllllllllllllllllillllMIIMIIIIIIlllll 

3ACAAGTCCATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTC 
rCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACA 

MMMMIIIIIIIMIIIIIIIIIIIIIIIIIMIIMMMIMIMIIIIIIMI 

rCTGACCGCCTCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACA 
^CCTCCATCGGCTACATCCGTAATATTTACAACCCGCCTGAAAGCAATGCTTCGGTCAT 

MMMMIIIIMIIIIIIIIIIMMMIIIIIIIIMIIMMIIIIIIIMIIII 
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lllllllllilillMIIMIIIIMIMIIIIIIIIIIMMMIIIIIIIIIIIIM! 

1 CTTTGACTACAGTGATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCC^ 







SBJCT : 


2881 


CTTTGACTACAGTGATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCA 


2940 






QUERY: 


6207 


GGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGC 
I M 1 i i i I ! 1 M I 1 i 1 1 i M 1 M i 1 1 1 1 1 M 1 1 I It II i ! 1 M II ! 1! M 1 i 1 M i M 1 i 


6266 






SBJCT : 


2941 


M 1 1 1 1 M M 1 M 1 M 1 I 1 i 1 1 1 1 II 1 II II It II 1 1 1 M ! I! 1 M 1! 1 ! M M i i f M i 

GGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGC 


3000 






QUERY: 


6267 


CGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGG 


6326 




10 


SBJCT : 


3001 


IIIMIIIMIIIMIIIIIMlllllllllllMIIIMIMIIIIIIIIIMIMIII 

CGTCACCTTCGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGG 


3060 






QUERY : 


6327 


GGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTA 

M ! 1 ! 1 ! II 1 1 1 1 1 1 1 M M 11 1 1 1 M 1 ! 1 1 II M M I 1 ! 1 1 1 1 i 1 1 1 M 1! 1 i 1 1 1 i 1 I 
1 1 1 1 II M 1 II 1 1 1 1 1 II 1 II II I M 1 1 1 1 1 1 II II II 1 1 M 1 1 M I 1 i II M M 1 M 1 1 

GGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTA 


6386 




1 ^ 

I J 


SBJCT : 


3061 


3120 




QUERY : 


6387 


CAGGTTCTCCGAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAG 
M it f II t I II i I II f 1 M i 1 M 1 i I t M II 1 1 I 1 1 ! ! 1 1 1 t M 1 ! 1 II M M ! 1 It 1 M 


6446 






SBJCT : 


3121 


1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 M M i M M II 1 M M 1 M 1 1 1 II 

CAGGTTCTCCGAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAG 


3180 




20 


QUERY: 


6447 


CTTCCGCATCGCAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTA 

t I 1 1 I I ! 1 1 M M 1 1 1 t 1 M i 1 1 I 1 1 1 1 1 1 1 I 1 I 1 1 1 t 1 1 1 1 1 ! 1 1 1 1 M 1 1 M t 1 II 1 1 


6506 






SBJCT : 


3181 


1 1 II M II 1 II 1 1 II 1 II II II 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 1 I 1 1 

CTTCCGCATCGCAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTA 


3240 






QUERY : 


6507 


CCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTA 
M M i M 1 M M 1 M 1 1 ! 1 1 M M 1 1 1 1 1 1 II 1 1 1 1 ! II 1! 1 M I ! M 1 1 1 1 1 1 1 1 M M 


6566 




SBJCT : 


3241 


I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 M I 1 1 I 1 M 1 1 I II 1 1 1 M 1 i 1 
CCGCTATGATGAGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTA 


3300 






QUERY : 


6567 


TGAC^TC^CC^GATCATCACC^CTGCCGTGATGACCCTCAGCAAACACTTCGACACCCA 
1 I 1 l 1 l 1 1 1 M M M M M II M M M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 ! 1 1 M 1 i M M M 1 1 M i 


6626 


€l 


30 


SBJCT: 


3301 


1 1 1 1 1 II 1 1 1 1 1 1 1 M II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II I 1 1 I 1 i II 1 1 1 11 
TGACATCAACCAGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCA 


3360 






QUERY : 


6627 


TGGGCGGATCAAGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGT 
1 [ 1 1 [ M i t M 1 1 1 1 II 1 M 1 1 1 M M 1 i M M i 1 M 1 1 1 1 I 1 1 1 1 1 1 1 M M M M 1 M 


6686 




J J 


SBJCT : 


3361 


1 1 1 1 II II 1 II 1 II 1 II II 1 1 1 1 1 M M 1 1 1 II 1 II II 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

TGGG CGGAT CAAGGAGGT C CAGTATGAGATGTT C CGGT C C CT CATGTACTGGATGACGGT 


3420 




QUERY : 


6687 


GCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAA 
II l 1 ! 1 l M M 11 II 1 1 U i II i II 1 1 M M 11 1 1 ! ! M 1 1 1 1 M M 1 M 1 1 1 M 1 1 1 11 


6746 


%%$ 




SBJCT : 


3421 


1 1 1 1 1 1 1 1 1 11 1 II 1 I 1 1 II 1 1 II 1 1 1 1 1 II 1 M 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

G CAATATGACAGCATGGG CAGGGTGATCAAGAGGGAG CTAAAACTGGGGC CCTATG C CAA 


3480 




40 


QUERY: 


6747 


TACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAA 
1 M 1 1 ! II \ 1 1 1 M 1 1 M f 1 M 1 1 M 1 II 1 1 1 1 1 1 1 It II II 1 1 M M 1 1 M II 1 t M t 1 


6806 






SBJCT : 


3481 


1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 11 1 \ I! 1 1 1 ! 1 1 1 1 Mil! 
TACCACGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAA 


3540 






QUERY : 


6807 


TGAC CG C C CGAC CTGGCGCTACAGC TATGAC CTTAATGGGAAT CTC CACT TACTGAAC C C 
1 1 1 1 I 1 i 1 1 1 1 I i I i J 1 i II 1 1 11 t 11 I I I I 1 1 I I 1 1 1 1 I M 1 II 1 1 I 1 1 1 1 1 II II ! 1 1 


6866 




SBJCT : 


3541 


1 1 1 1 II II 1 1 II 1 1 1 II 1 1 1 1 II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 i 1 1 1 1 1 1 
TGACCGCCCGACCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCC 


3600 






QUERY : 


6867 


AGGCAACAGTGTGCGCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACT 
II M 1 1 i II 1 1 M 1 II 1 M t 1 1 11 1 1 1 I I M II II 1 1 M M II 1 1 M M f M 1 i 1 1 II 1 i 


6926 




50 


SBJCT : 


3601 


1 1 II II II M II 1 1 1 1 1 II M M 1 II 1 II 1 1 1 M 1 1 1 1 II 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 
AGGCAACAGTGTGCGCCTCATGCCCTTGCGCTATGACCTCCGGGATCGGATAACCAGACT 


3660 






QUERY : 


6927 


CGGGGATGTGCAGTACAAAATTGACGACGATGGCTATCTGTGCCAGAGAGGGTCTGACAT 
1 i 1 l M 1 II M 1 II M 1 M 1 I M 1 1 II 1 II 1 1 1 1 1 1 1 I 1 t 1 1 1 1 M t 1 t 1 II M t It 1 I 1 


6986 




jj 


SBJCT : 


3661 


1 II 1 1 1 1 1 1 II J 1 1 II II 1 1 1 II 1 1 1 1 II 1 1 II 1 1 I 1 M 1 1 ! 1 II 1 1 1 II 1 1 1 i 1 1 I 1 1 1 
CGGGGATGTGCAGTACAAAATTGACGACGATGGCTATCTGTGCCAGAGAGGGTCTGACAT 


3720 




QUERY: 


6987 


CTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAGGCCAGCGGGTGGAG 
1 M t 1 ! 1 1 t 1 M M ! M 1 1! 1 ! 1 M ! 1 1 1 1 1 M 1 M 1 M 1 1 M 1 M 1 1 1 i II 1 It 1 i 1 It 


7046 






SBJCT : 


3721 


1 M 1 1 M M M 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 M 11 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAGGCCAGCGGGTGGAG 


3780 




60 


QUERY : 


7047 


TGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCA 


7106 






SBJCT : 


3781 


IlilMIIMIIIIIIIMIIMIIIMIIIIIIMIIMIIIIMIIIIIMIIIIIII 

TGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGGCCA 


3840 






QUERY : 


7107 


CCACCTGCAGTACTTCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAA 

M t 1 1 1 M 1 1 11 1 1 M 1 1 1 1 M 1 1 1 1 11 M 1 1 I 1 1 1 M 1 1 1 1 1 It 1 t 1 M 1 1 1 1 1 1 M 1 1 
1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 I 1 1 1 

CCACCTGCAGTACTTCTACTCTGACCTCCACAACCCGACGCGCATCACCCATGTCTACAA 


7166 




SBJCT : 


3841 


3900 






QUERY: 


7167 


TCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGCCAT 


7226 




70 


SBJCT : 


3901 


IIIMlMMMIMIMIIIIIIllllMIIMllllllltMllllllllllllllll 

T CACT CCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGC CAT 


3960 



15966-697 



QUERY: 7227 GGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCCTCTGGC 7286 

1 1 1 1 1 1 1 1 i I f 1 1 1 M 1 ! I i 1 1 1 1 1 1 i I i M 1 1 M ! 1 1 1 1 1 1 1 1 E 1 1 ! 1 1 1 i 1 1 1 i I ! M 

SBJCT: 3 961 GGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGGACTCCTCTGGC 4 020 



QUERY: 7287 TGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGAT 7346 

Mill ! I [ 1 1 : M I i I ; I M 1 1 i 1 1 1 1 1 M M 1 1 1 1 ! I i ! 1 1 1 1 1 1 1 ! 

SBJCT: 4021 TGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCCTATGGGGAGAT 4080 
10 QUERY: 7347 TTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGA 7406 

I M M 1 1 M 1 1 M I f I M E M i I ! I II M M M 1 1 1 1 1 1 1 1 1 M t ! 1 1 M M M 1 M 1 1 1 

SBJCT: 4081 TTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGGGGACTCTATGA 4140 
QUERY: 7407 CCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCAGGACGATG 7466 

15 M M 1 1 1 1 1 M 1 M I E M I M M E M f 1 1 1 II 1 1 1 1 1 1 1 1 M I It ! 1 1 1 1 1 1 M 1 1 1 1 1 i 

SBJCT: 4141 CCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCAGGACGATG 4200 
QUERY: 7467 GACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCT 7526 

IIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIMMIIIIMIMIIIIIIIIIIII 

20 SBJCT: 4201 GACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCT 4260 
QUERY: 7527 GTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGAC 7586 

IIMIMIIIIIIMMIIMIIIIIIMIMIIIIIIIIMIIIMIIMIIIIMIII 

SBJCT: 4261 GTATATGTTC^GAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGAC 4320 



25 



45 



65 



QUERY: 7587 AGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTT 7646 

llllllllllllllllllllllMllllllllllllllllllllllilMIIIIIIIMI 

SBJCT: 4321 AGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTT 43 80 



30 QUERY: 7647 CCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAG 7706 

! I ! 1 1 1 : 1 : M i I ! 1 1 ! : I ! i 1 1 1 1 1 1 1 1 1 1 i 1 1 M 1 1 1 1 i 1 1 

SBJCT: 43 81 CCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAG 4440 
QUERY: 7707 TGAGAATGGACAGCTGATTACAGGTGTCCAACAGAGAACAGAGAGACATAACCAGGCCTT 7766 

35 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 II I II 1 1 i I II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II I 

SBJCT: 4441 TGAGAATGGACAGCTC^TTAC^GGTGTCC^CAGACAACAGAGAGACATAACCAGGCCTT 4500 
QUERY: 7767 CATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGC 7826 

MIIIIIIIIIIMIMMMIIIIIMMIIIIIIIIIIIIIIMIIIMIIMIIIII 

40 SBJCT: 4501 CATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATCCGAGAGAAAGC 4560 
QUERY: 7827 AGGTCACTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCA,TGTTTGCC^TCAA 7886 

IIIIIIIIIIIIIIIIIIMIIMMMIIIIIIIIIIIMIIIIIMIIIIIIIIIIII 

SBJCT: 4 561 AGGTCACTGGTTTGCCACCACCACGCCG^TCATTGGCAAAGGCATCATGTTTGCCATCAA 4620 



QUERY: 7887 AGAAGGGCGGGTGACCACGGGCGTGTCCAGCATCGCCAGCGAAGATAGCCGCAAGGTGGC 7946 

IIMMIMIIIIIIIIIIIIIIIIIMIIIIMIMIIIIIIIMlllllillllllll 

SBJCT: 4621 AGAAGGGCGGGTGACCACGGGCGTGTCCAGCATCGCCAGCGAAGATAGCCGCAAGGTGGC 4680 



50 QUERY: 7947 ATCTGTGCTGAACAACGCCTACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGA 8006 

IIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIMIIMIMMIIIIMI 

SBJCT: 4681 ATCTGTGCTGAACAACGCCTACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGA 4740 
QUERY: 8007 CACCCACTACTTTGTGAAGATTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCAC 8 066 

55 IMIIIIIIIIIIIIIIIMIIMlllllMIIIIMIMIIIMIMIlllillMMI 

SBJCT : 4741 CAC C CACTAC TTTGTGAAGATTGGCT CAGCCGATGGCGAC CTGGTC ACACTAGGCAC CAC 4 800 
QUERY: 8067 CATCGGCCGCAAGGTGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCT 8126 

1 1 i I i 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 

60 SBJCT: 4801 CATCGGCCGCAAGGTGCTAGAGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCT 4860 
QUERY: 8127 GGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCT 8186 

IIIIIIMIMIIIIIIMMMIIIIMIIIMMMIIIIIIMIIIIIIIIIIIIII 

SBJCT: 4861 GGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCT 4920 



QUERY: 8187 CAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGA 8246 

MMIIIIIIMMMIIIMIIMIIIIIIIIIIIIIIIIIIMIIMilllllMMI 

SBJCT: 4921 CAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGA 4980 



70 QUERY: 8247 CCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGA 83 06 
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IMMllllllllillllllllllllMlllllllllllllMMMMIIIIMMIM 

SBJCT: 4981 CCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGA 5040 
QUERY: 83 07 CGGGAGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGG 8366 

5 t j 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 i t f 1 1 i I i 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 f i E 1 1 i i i f 1 1 i r f I 

SBJCT: 5041 CGGGAGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGG 5100 
QUERY: 8367 GCGCGTGCAAGGGTACGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGC 8426 

1 1 1 ] 1 1 f M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 E i i 1 1 1 1 f 1 1 1 1 1 1 1 1 ! 1 1 f 1 M 1 1 i i M 1 f I i 1 1 

10 SBJCT: 5101 GCGCGTGCAAGGGTACGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGC 5160 
QUERY: 8427 AGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAACAAAA 84 86 

MIMMIIIIIMMIMIMIIIMIMIMIIIIIIIIMIIIIIIIIIIIIIIIII 

SBJCT: 5161 AGACAGTAGCAGCAACATCCAGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAACAAAA 5220 



15 



35 



50 



55 



60 



65 



70 



QUERY: 8487 TAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCCT 8546 

M 1 M 1 i 1 1 i 1 1 1 11 1 1 E f i i I f I M 1 1 M 1 1 1 1 1 1 ! 1 1 1 1 ! ! 1 1 i I i 1 1 1 1 f I M 1 1 ! I 

SBJCT: 5221 TAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAGGAGTAACTGTTATCTCCTCTCCT 5280 



20 QUERY: 8547 AAGGAGATGAAGACCTAACAGGGGCACTGCGGCTGGGCTGCTTTAGGAGACCAAGTGGCA 8606 

iimiiiimiiiiiiiMiiMiiimiiiiiiiimiiiiiimiMiiiMi 

SBJCT: 5281 AAGGAGATGAAGACCTAACAGGGGCACTGCGGCTGGGCTGCTTTAGGAGACCAAGTGGCA 5340 
QUERY ■ 8607 AGAAAGCTCACATTTTTTGAGTTCAAATGCTACTGTCCAAGCGAGAAGTCCCTCATCCTG 8666 

25 1 1 1 ! 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBJCT: 5341 AGAAAGCTCACATTTTTTGAGTTCAAATGCTACTGTCCAAGCGAGAAGTCCCTCATCCTG 5400 
QUERY: 8667 AAGTAGACTAAAGCCCGGCTGAAAATTCCGAGGAAAACAAAACAAACGAATGAATGAACA 8726 

MIIIMIIMIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

30 SBJCT: 5401 AAGTAGACTAAAGCCCGGCTGAAAATTCCGAGGAAAACAAAACAAACGAATGAATGAACA 5460 
QUERY: 8727 GACACACACAATGTTCCAAGTTCCCCTAAAATATGACCCACTTGTTCTGGGTCTACGCAG 8785 

IIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIMIillll 

SBJCT: 5461 GACACACACAATG TT C CAAGTT CC C C TAAAATATGAC CC ACTTGTTC TGGGTCTACGCAG 5520 



QUERY: 8787 AAAAGAGACGCAAAGTGT 8804 

IMIIIMMIIMMII 
SBJCT: 5521 AAAAGAGACGCAAAGTGT 553 8 



40 SCORE = 1362 BITS (687), EXPECT = 0.0 
IDENTITIES = 705/714 (98%) 
STRAND = PLUS / PLUS 



45 



QUERY: 


8875 


CACGGAC CGATAAACAAAGAAG CGAAGATAAGAAAGAAGG CCT CATAT C CAATTAC CT CA 


8934 


SBJCT : 


5609 


IIMIIIIIIMIMIIMIIIIMIIIIIIIIIIIIIIIIMIIMIMIIMIIIMI 

CACGGACCGATAAACAAAGAAGCGAAGATAAGAAAGAAGGCCTCATATCCAATTACCTCA 


5668 


QUERY: 


8935 


CTCATTCACATGTGAGCGACACGCAGACATCCGCGAGGGCCAGCGTCACCAGACCAGCTG 


8994 


SBJCT : 


5669 


IIMillMIMlMllMllllllMIIMIIIIIIIilllillllllllllMMMl 

CTCATTCACATGTGAGCGACACGCAGACATCCGCGAGGGCCAGCGTCACCAGACCAGCTG 


5728 


QUERY : 


8995 


CGGGACAAACCACTCAGACTGCTTGTAGGACAAATACTTCTGACATTTTCGTTTAAGCAA 


9054 


SBJCT : 


5729 


f 1 i ! 1 1 f 1 i E i i 1 1 1 1 1 £ 1 1 1 i M i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 i 1 1 E 1 1 1 1 i i 1 1 1 1 1 1 i 1 

CGGGACAAACCACTCAGACTGCTTGTAGGACAAATACTTCTGACATTTTCGTTTAAGCAA 


5788 


QUERY: 


9055 


ATACAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGG 


9114 


SBJCT : 


5789 


lllllllllllllllMIMMIilllllMIIIIIIMMIIMIMIIlMIIIMM 

ATACAGGTGCATTTAAAACACGACTTTGGGGGTGATTTGTGTGTAGCGCCTGGGGAGGGG 


5848 


QUERY : 


9115 


GGATAAAAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGISnsnS^^ 9174 
in i i i i i i i i M I ; I I I i 1 1 M I I I I I 1 I 1 llillllll 


SBJCT : 


5849 


1 M f I E ! 1 E I 1 M 1 M 1 1 1 i 1 f 1 I I ! 1 M i M i 1 1 1 E 1 1 1 I 1 llillllll 
GGATAAAAGAGGAGGAGTGAGCACTGGAAATACTTTTTAAAGAAAAAAAAACATGAGGGA 


5908 


QUERY: 


9175 


ATAAAAGAAATTCCTATCAAAAATCAAAGTGAAATAATACCATCCAGCACTTAACTCTCA 


9234 


SBJCT : 


5909 


IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

ATAAAAGAAATTCCTATCAAAAATC^AAGTGAAATAATACCATCCAGC^CTTAACTCTC^ 


5968 


QUERY : 


9235 


GGTCCCAACTAAGTCTGGCCTGAGCTAATTTATTTGAGCGCAGAGTGTAAAATTTAATTC 


9294 


SBJCT : 


5969 


MMIMIIIIIMIMIIMIMIIIIIIIIIIIIIIIIIIMMMMMIIIIMM 

GGT C C CAAC TAAGT CTGGC C TGAGCTAAT TT AT TTGAG CGCAGAGTGTAAAAT TTAATTC 


6028 
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QUERY - 9295 AAAATGGTGGCTATAATCACTACAGATAAATTTCATACTCTTTTGTCTTTGGAGATTCCA 93 54 

MIMMIIMMIIIMItlllMMIIIIIIIIIIIIIIIIIMIIilllMMMil 

SBJCT* 6029 AAAATGGTGGCTATAATCACTACAGATAAATTTCATACTCTTTTGTCTTTGGAGATTCCA 6088 

5 

QUERY * 9355 TTGTGGACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGT 9414 

MMlMMMlMlllllMIMMIMlllllllillllMllMMIMIIIIIlM 

SBJCT ; 6089 TTGTGGACAGTAATACGCAGTTACAGGGTGTAGTCTGTTTAGATTCCGTAGTTCGTGGGT 6148 
10 QUERY: 9415 ATCAGTTTCGGTAGAGGTGCAGCATCGTGACACTTTTGCTAACAGGTACCACTTCTGATC 9474 

MIIIMIMMIIIIIlllliMMMIIIIIMMIllllllllllMllMIIMM 

SBJCT: 6149 ATCAGTTTCGGTAGAGGTGCAGCATCGTGACACTTTTGCTAACAGGTACCACTTCTGATC 6208 
QUERY: 9475 ACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTTTCAGATTTAAAATTATTAGTG 9534 

15 MlllllllllllMMIllllMlliMMIIIIIIillllllllllllllllllllll 

SBJCT: 6209 ACCCTGTACATACATGAGCCGAAAGGCACAATCACTGTTTCAGATTTAAAATTATTAGTG 6268 
QUERY * 9535 TGTTTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGAG 9588 

IIMlMMIIIIlllllMlllilMMIllllMMMIIMIIIIMIMI 

20 SBJCT: 6269 TGTTTGTTTGGTCCAGAAACTGAGACAATCACATGACAGTCACCACGAGGAGAG 6322 



25 



55 



SCORE a 349 BITS (176), EXPECT = 2E-92 
IDENTITIES = 176/176 (100%) 
STRAND = PLUS / PLUS 



QUERY: 9651 GT CTAATAAGAACTTTGGTACAGGAACTT TT TTGTAATATACATGTATGAAT TGTTCAT C 9710 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 f I f 1 1 i 1 i E 1 1 1 1 i 1 1 1 1 1 1 1 1 f i ) 1 1 1 1 1 1 J f 1 1 1 1 1 1 f 1 1 1 1 i 1 

SBJCT: 6385 GTCTAATAAGAACTTTGGTACAGGAACTTTTTTGTAATATACATGTATGAATTGTTCATC 6444 
30 QUERY- 9711 GAGTTTTTATATTAATTTTAATTTGCTGCTAAGCAAAGACTAGGGACAGGCAAAGATAAT 9770 

1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! i 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 i 1 1 1 1 

S BJCT : 644 5 GAG TTTT TATATTAATTTTAAT TTGCTGCTAAG CAAAGACTAGGGACAGG CAAAGATAAT 6504 
QUERY: 9771 TTATGGCAAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 9826 

35 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! I ! 1 1 1 1 1 M 1 1 ! 1 1 1 M I M 1 1 1 1 1 1 M M 1 1 

SBJCT: 6505 TTATGGCAAAGTGTTTAAATTGTTTATACATAAATAAAGTCTCTAAAACTCCTGTG 6560 

In this search it was also found that the FCTR3bcd and e nucleic acids had homology to 
five fragments of Mus musculus mRNA for Ten-m2. It has 5498 of 6108 bases (90%) identical to 
40 bases 2504-8610, 1095 of 1 196 bases (91%) identical to bases 103-1298, 1000 of 1088 bases 
(91%) identical to bases 1420-2540, 81 of 89 bases (91%) identical to bases 8655-8743, and 30 
of 32 bases (93%) identical to bases 7-38 of Mus musculus mRNA for Ten-m2 (Table 3M). 

Table 3M. BLASTN of FCTR3b, c, d, and e against Mus musculus mRNA for Ten-m2 

Mrna (SEQ ID NO:65) 

45 > GI | 4760777 | DBJ [ AB025411 » 1 ) AB025411 MUS MUSCULUS MRNA FOR TEN-M2, COMPLETE CDS 
LENGTH = 8797 

SCORE = 7263 BITS (3664), EXPECT = 0.0 
IDENTITIES = 5498/6108 (90%) , GAPS = 1/6108 (0%) 
50 STRAND = PLUS / PLUS 



QUERY: 2578 GATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 263 7 

MIIIIIIIIIMI 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 ! I M 1 1 1 II 1 1 II I ! I! 1 1 1 1 1 1 1 1 1 

SBJCT: 2504 GATGGCTGCCCTGATTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 2563 
QUERY : 2638 CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCC 2697 

IIIIIIIIIIIIIIIIIM1IIIIIII1IIII MMMIIIIII1IIIMMIII III 

SBJCT: 2564 CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCTGGATGCAACGTTGCCATGGAAACCTCC 2623 



60 QUERY: 2698 TGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGATTGTTTGGACCCTGACTGC 2 757 

II IIIIIIIMIIIIIIIIMIIIIIIMIIIIimill II IIIIIIIIIIIMI 
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SBJCT: 2624 TGCGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGACTGCCTGGACCCTGACTGC 2683 
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QUERY : 


2 /DO 


rnn nn*vn n a n t n a r* n ptp t r* a n a a r A dC fTH r TC TClCCCiClClCtfVTC C CGGGACCCACTGGAC 


2817 


fin i ^ 

SBJCT : 


26 84 


Mill IMMMMMMMMMIMMMMMIMIMM IIIMIII Mill 

T , r»r , r , ^ar*?\nT , r*apr , r*TH^r t ar*Aapar^pr , TriPTr , T^ 


2743 


ATTODV . 

QUEKx : 


2818 




2877 


fin tv*'"p . 


2744 


lllllllllll II Mill II Mill MIMMMMMIMIMMM IMMI 

a t r* a ttp a n p a a p p t p a p a P a P, a PTftttP CTdC &Q r TCl A A (TF C CTTCT ATG AC CGC AT CAAG 


2803 


PvTTTTTDV • 

yUii*Kx : 




z-frri T p p p a p/-_*p 7\ a r»r* a a p p a p p p a p a TCATT CC TGGAGAGAAC CCT TT CAACAGCAGC 


2937 


bBULl : 




lllllllllllllllll MIMIIIIMIMMIIIIMI Mill Mill IMMI 

prrPT'w^ppapppAap^APAPPAPPPAPATPATTrPTrtGAGACAACCCCTTCAA^ 


2863 


QUERY : 


2938 


TTGGTTTCTCTC^TCCGAGGCCAAGTAGTAACTACAGATGGAACTCCCCTGGTCGGTGTG 


2997 


SBJCT : 


2864 


III! HIM IIIIIMMIIIMIIIIII 1 Mill MMM MM MUM 

CTGGTGTCTCTGATCCGAGGCCAAGTAGTAACCATGGATGGGACTCCCTTGGTGGGTGTG 


2923 


QUERY : 


2 998 AACGTGTCTTTTGTCAAGTACCCAAAATACGGCTAC^^ 

it t I > I i I 1 l I i 1 t 1 1 1 1 1 1 1 II 1 1 M 1 IIIIMMIMN! ! 1 M 1 1 1 1 1 1 1 1 1 I 1 


3057 


SBJCT : 


2924 


M i 1 f M 1 1 M I M II M 1 1 1 It M M 1 M M 1 M 1 II 1 II 1 1 I t t l l l l l l M M I 
AATGTGTCTTTTGTCT^GTACCGAAAATATGGCTACACCATCT^CTCGCC^ 2983 


QUERY : 


3058 


TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTC 3117 
it i i t i i i i r it t i i i t til t i i i i i M t l l l l l 11 1 i 1 I M 1 1 1 1 1 1 I Ml 


bBJLi : 


2984 


1 1 MINIM II 1 1 1 1 1 II! 1 I 1 1 I 1 M 1 1 1 1 M 1 1 1 M M M M 1 i 1 l Ml 

TTTGACCTGATTGCCAATGGGGGTTCTGCCTTGACTCTTCACTTTGAGCGAGCCCCTTTC 


3043 


QUERY : 


3118 


ATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTG 


3177 


SBJCT : 


3044 


MIMMMMMMM IMIMIMM 1 M 1 1 M II II II MMMMIMIMI 

ATGAGCCAGGAGCGCAGW^TGTGGCTGCCTV^ 


3103 


QUERY: 


3178 


GTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 


3237 


SBJCT : 


3104 


M IMMMMIMM MIMIIMIMIIMMIIMMIIIIMMMIIIMM 

GTAATGAAGACCGAGGAAAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCA 


3163 


QUERY : 


3238 


GATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 

MIMMMIMMMMM II 1 1 i 1 1 1 1 f I f 1 1 1 1 M Ml MMM Ml 

GATCCAATCATCATCTCCTCTCCTCTGTCCACCTTCTTCAGCGCTTCCCCTGCCTCGAAC 


3297 


SBJCT : 


3164 


3223 


QUERY : 


3298 


CCGATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTG 


3357 


SBuLl : 


3224 


Mill MMIMMMMMMMIIMMMIMM IMMMMIM Mllllll 

CCCATTGTGCCTGAGACCCAGGTTCTTCATGAAGAAATTGAGCTCCCTGGTACCAATGTG 


3283 


QUERY : 


3358 


AAACTTCGCTATCTGAGCTCTAGAAL I GLACjGCj x ALAALr I LAL i LrL i 1 LALLA l vj 




SBJCT : 


3284 


II II II Mill II 1 1 1 1 II 1 II II 1 M II II Mill MIMMMIMMIMI 

AAGCTCCGTTATCTCAGCTCTAGAACTGCAGGGTATAAGTCGCTGCTGAAGATCACCATG 


3343 


QUERY : 


3418 


ACCCAGTCCACAGTGCCCCTGAACCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGG 


3477 


bBJ LI : 


3344 


II MMIMMIIMM MINIM IMIIIMI IMIMI Mill IMMI 

ACGC7VGTCCACAGTGCCCTTGAACCTCATCAGGGTTCACTTGATGGTTGCTGTAGAGGGG 


3403 


ATTCDV . 


3478 


CATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 


3537 


CD TPT* . 

boo Li : 


3404 


MIMMMMMMMMMMIMMMMMMM MM Ml MIMMIMM 

GATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTAGCCTACACATTCATCTGGGAC 


3463 


QUERY : 


3538 


AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGG 


3597 


SBJCT : 


3464 


MIIMIMM IMMMMIMM II 1 II II II 1 1 1 1 II II 1 II 1 1 II 1 1 Ml 

AAGACAGATGCTTATGGCCAAAGGGTTTATGGCCTATCGGATGCTGTTGTGTCTGTTGGG 


3523 


QUERY : 


3598 


TTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAG 


3657 


br5U L i : 


3524 


MMMMMMIMM Mllllll II II IMMIMIIMMMMM IMMI 

TTTGAATATGAGACCTGCCCCAGTCTCATCCTGTGGGAGAAAAGGACAGCCCTGCTTCAG 


3583 


ATTCDV • 
VUJSK. x - 


3658 


GGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTC 


3717 


SBJCT : 


3584 


IMMIIMIIIMMI IIIIMM II IMIMIMM MMIIIMIIII MM 

GGATTCGAGCTGGACCCTTCCAACCTTGGAGGCTGGTCCCTGGACAAACACCACACCCTC 


3643 


QUERY: 


3718 


AATGTTAAAAGTGGAATCCTACACAAAGGL^CTGGGGAAAACC^GTTCCTGAL^ 


3777 


SBJCT : 


3644 


II 1 II HIM Mill IMIMIMM II HIM M 1 II M 1 1 M 1 1 1 II II 1 II 

AATGTGAAAAGCGGAATACTAGAGAAAGGGACAGGGGAGA^ 


3703 
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QUERY : 


3778 


on T7'""T , - 

SBJCI : 


3704 


QUERY : 


3 838 


fin Tfirp 

bBJ CI : 


5 /o4 


ATTCDV . 

QUbRY : 


1 O Ct o 


CD TPT 1 . 




QUERY : 


3 958 


SBJCI : 


3884 


QUERY : 


4018 


SBJCI : 


3944 


ATTPtlV . 

QUciRY : 


A A H Q 

40 /o 


bBUCl : 




QUERY : 


413 8 


SB JCT : 


4064 


QUERY : 


4198 


on Torn 

SBiJCT : 


4124 


QUERY : 


4258 


CD TPT . 




AT TTT'TITT' 

QUERY : 


4318 


SBJCT : 


4244 


QUERY : 


4378 


O n mrri 

SBJCT : 


43 04 


QUERY : 


4438 


OD TOT . 


43 64 


QUERY : 


4498 


SBJCT : 


4424 


QUERY : 


4558 


CD TOT* . 


44 84 


QUERY : 


4618 


SBJCT : 


4544 




A C H O 

4o /o 




ACC\A 


QUERY : 


4738 


SBJCT : 


4664 


QUERY : 


4798 



IIIMIIIMMI 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 lllllllll I Mill llllllllllll 

CTGCCATCATCACGAGCATOVT^ 3 763 

3CAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATC 3 897 

INI llllllllllllltllllll 111 I Mill Mill IMIMII II III 

3CAATGGCCTTGCTGAAGGCAACAAACTGTTAGCCCCTGTGGCCCTGGCTGTGGGGATC 3823 
^TGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAAT 3957 

10 ~ II MM III MM Ml lllllllllll II Mill IIIIIIIIIMIIIMMIM 

\TGGGAGCCTCTTTGTTGGTGACTTCAACTATATCCGGCGCATCTTTCCCTCTCGAAAT 3 883 
rGACC^GCATCTTGGAGTTACGAAATAAAGAGTTTAAAC^TAGCAACAACCCAGCACTVC 4017 

lllllll MMIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIIII Mill MM 

rGACCAGTATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAGCCCAGGACAC 3943 
^GTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCTACGTGTCCGACACCAACAGC 4077 

MMMMIMM 1 1 1 1 f 1 1 1 1 1 1 1 I Mill llllllll II IMIIIMIII 

^GTACTACTTGGCTGTGGACCCCGTGACTGGCTCACTCTACGTCTCTGACACCAACAGT 4 003 
3GAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAA 413 7 

I IIIIIIMM II II 1 1 1 1 1 1 1 II I Ml 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 IMIIMM 

3CCGAATCTACCGAGTCAAGTCTCTGAGCGGAGCCAAAGACCTGGCTGGAAATTCGGAA 4 063 
TTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGGATGGA 4197 

MIMIIMMII II II II IMIIIIIIIIIMIIMMMMM IMIIMM 

TTGTGGCAGGGACTGGCGAACAATGTCTACCCTTTGATGAAGCCCGCTGTGGGGATGGA 4123 
GGAAGG CCATAGATGCAAC CCTGATGAG CC CGAGAGGTATTG CAGTAGACAAGAATGGG 4257 

30 IIMIIII I II III II lllllll I II I llllll llllll IMMIII lllllll 

3GAAGGCTGTGGACGCCACCCTGATGAGCCCCAGAGGTATTGCAGTAGACAAGAATGGG 4183 
rCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCC 4317 

I MMMIIMI I i 1 1 1 11 i 1 11 1 1 1 M 1 1 M I M Mill M MMMMMM 

rTATGTACTTTGTTGATGCCACCATGATCCGGAAGGTGGACCAAAACGGAATCATCTCC 4243 
CCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATG 4377 

MIMMMIIIIIMMIIIIIM II Mill II MMMIIMI 1 1 llllll 

CCCTGCTGGGCTCCAATGACCTCACAGGTGTCCGACCACTGAGCTGTGACTCGAGCATG 4303 
^TGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTGCTGTCAATCCCATGGATAAC 443 7 

I M llllllll IMM II Mill IMIMII II Mill IIMIIII Ml 

ACGTGGCCCAGGTCCGTCTAGAATGGCCGACAGACCTCGCCGTCAACCCCATGGACAAC 4363 
C CTTGTAT GTT CTAGAGAACAATGT CAT C CTT CGAAT CAC CGAGAACCAC CAAGT CAGC 44 97 

II MM IMM IIMIIII IIMIIII M MM! Ml MM MUM 

CCCTGTACGTTCTGGAGAACAACGTG^TCCTGCGGATCA.CGGAGAACCACCAGGTCAGC 4423 
TCT^TTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTCACTCAGCAAA 4557 

50 ~ Mill llllllll M IMIIIMIII Mill Mill IMIMII IMIIMM 

rCATCGCGGGACGGCCTATGCACTGCCAGGTTCCCGGCATCGACTACTCGCTCAGCAAA 4483 
rAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTC 4 617 

I Mill IMIMII Mill IMMIII IIIIIIIIIIIIIIMMIIIMIM 

rCGCCATCCACTCTGCGCTGGAATCAGCCAGCGCCATTGCCATTTCTCACACTGGGGTG 4 543 
T CTACAT CAC TGAGACAGATGAGAAGAAGATTAACCGT CTACG CCAGGTAACAAC CAAC 4677 

IMMMMMMMI II MMMIIMI Mill MIIIMI II II Mill 

rCTAC^TCACTGAGACGGACGAGAAGAAGATCmCCGCCTACGCCAAGTCACCACCAAT 4603 
3GGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTCAAT 473 7 

I MMMIIMI Mill IMM Mill IMM I II II II II II I M 1 1 II 1 1 

3AGAGATCTGCCTCTTAGCCGGGGCGGCCTCAGACTGTGACTGCAAAAACGATGTCAAC 4663 
3CAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCA 4797 

IM llllll II llllllll II Mill II M llllll MM II II II 

GCATCTGCTACTCGGGAGATGACGCTTACGCCACGGACGCCATCCTGAACTCGCCGTCC 4723 
CCTTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAGACCTTGGAAATATTCGGATC 4857 

70 llllllll II II III Mill Mill III IMMMMMMI Mill MIMI 
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SBJCT: 4724 TCCTTAGCCGTGGCTCCGGATGGCACCATCTACATTGCAGACCTTGGGAATATCCGGATC 4783 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 





A Q C O 


apppppptp app a AP A APAAPPPTPTTPTT A ATGPPTTCAACCAGTATGAGGCTGCATCC 

nuuUUub X bi^b7bi^/^biTi^b-T^iv_J V. X vJ X X \— X irifll UU^-. X X unnuv-nu xjttlx unwj v-. j. uv^rn v^-'v* 


4917 


Qp TPT * 


AH RA 


IIMIIIIMIIII II II II llllllll II IIIIIIIIIMIIIIIIIIIIII 

AnnaranTrAnrAAAAATAAArCCGTTCTTAACGCATTCAACCAGTATGAGGCTGCATCT 


4843 


HTTI7DV . 


A Q 1 Q 
*± y X O 


pp rr p a p a p P A nfj a PT T AT A TPTT TT PAAPG CTGATGG CAT C CAC CAATACAC TGTGAGC 


4977 


CD TOT 1 - 

SBJCT : 


4844 


II Mill Mill II M M IMIIIIIIMMI Mill It IIIIIMMII 

n r^r* r* r* a p a a p a p p a a ttct a pptpttp a A PPPTP A TP PT A T P P A T P APT A P APTGTGAGT 

b bbb bxA-bxAribAb bj-Hi-i. X IvJinUulul X brlf'_bv_tb lunlUwlril v_V-_-_x X i-_v_-_-_v_- ivjiunui 


4903 


QUERY : 


4978 


/~*rnr* r* tv a A"« 7\ o r"»p r* a P T 7\ at 1 TP T a P A A T TTP APATATAPTAPTPAPA ATPATPTPAPTPA A 


5037 


SBJCT : 


4904 


MMMM MIIMIIIIIMMIIIIIIIIMI II IMMIIMI Mill II 

r>fnp rppi 7\ r %r vc t en r* a r* t* a r ir r r rn , T a h a a tttp a c a t a r* A(*^ Pf^PTf^ACAATnAr'nTCACCGAG 


4963 


QUiiKY : 


ET At *3 O 


TTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGT 


5097 


SBJCi : 


4964 


IMIMMIM II IMMMMII llllllll IMMMIMMMIMMIMI 

TTAaTTABAa A pa APPPPA ATTPPPTA A Af^ATPPr^CPPrnnAPAnP Af^THGCATGCCCCGC 


5023 


ATTDDV . 


boy o 


PAPP'FPP'T'PATPPPTY^APA APP AfttATP ATP APPPTPAPPGTGGnCACCAATGPrAGGCCTC 

IIIIMMIIIMI II II Mill MINIM II 1 II 1 II 1 1 1 1 II 1 1 1 1 II II 1 

p 7\ p ptp prpp A TA P PPP A T A A TP A ft A TT A T P A P P P TT A PTnTPrPrPrP A C C A ATGGAGGCCT C 


5157 


fin T A"*T> _ 

SBJCT : 


5024 


5083 


QUERY : 


5158 


7\ 7\ Tir*rTtr*r*TT*r*'Tir*r*T\ atvpaaa a pptpp 7\P PTTP-PTPTPTA TP. A PPT ATP,ATPP,P A A PAPTPtPt^ 


5217 


SBJCT : 


5084 


MM IIMMIII IMIIIIIIMMI M MMMM MMMM MMMM 

7\ 7\ 7iAPAP f ?A r rr i A7\ ptpapa 7iPPTPP APPTPPPPPTPATP,A PTTATHATPr^riA APAPTGnA 


5143 


QUERY : 


5218 


rHTiririrvrtr* rir+n PP7\ t\ AAAAA7\TA A A A PAPP ATPP APP-APTTTPTATP,APTATPtAPPAPPtA A 
(_ I CC 1 (j(jCCACCAALji\LrL-Lr/\l ortiiL^ x l X L- irtx o^^Xi^xo^^v_^^^j!T-t-v 


5277 


/-I'D TA^nP . 

SBJCT : 


5144 


Mill IIIIIMIIII llllllll llllllll Mill M M II 1 1 1 1 II 1 1 1 1 1 

ptppt apppa pp A AP APTP ATP A A APPrtP-ATHH AP A APTTTTTATGAPTATGACPAPGAG 


5203 


QUERY : 


5278 


paapp pptp A PP7\ a PP TP A PP PP PPPP A PP,nP.P,P,TPP,T A A PPAPTPTPrP A PPGGG A A ATG 


533 7 


SBJCT : 


5204 


Mill llllllll Mill IIMIIMIII Mill MIMMIIIMIIMMIM 

r*/~*r*r*i-*<-nr*rr<r<T\ r*r*T\ 7\rnr«rpy^t7\ ( -i^/-i/-ir i PPPA PPPPPPTPPTP APPAP.TPTPPAPPPP-PA A ATP- 


5263 


QUERY : 


5338 


/~t 7\ /^t 7\ 71 7\ rp^-rn7\ rprn7\ r*r* 7\ mTP A P A TTP A P A APTPPA A PPP TP ATP A TP. A PPTP A PTf^lTP A TP 


53 97 


SBJCT : 


5264 


1 1 f 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIIIIIMIIMIM IIMIIIIMIMIIII Ml 

GAGAAATCTAT CACCA 1 I CjACA 1 1 (jAbAAL. 1 bLAAbbbbbA 1 1 bjA.L.L3- X i>cr X bH X V_ 


53 23 


QUERY : 


5398 


ACCAACCTCTCTTCAGTAGAGGCCl CC I ACACACr 1 bb i ACAAbjA 1 LAAb l x UbjbjAAUrt.\ju 


^.m 7 


SBJCT : 


5324 


IIIIIMMII II II IIIIIMMII MIMIMMMIMMIM II llllll 

a nM7\ a /^/^n^^^^i^ 1 Ti 1 ^f^A'' r Pr" , A" 1 7\pp AA^AATTV T'A PAPTPPTA PA AP A TP A AP-TP-PP A A AP AP-P 
ACCAACCTC I CC 1 CCG I t_rt_rAt_it_rCC I CC 1 A 1 AtJAbj I Gbj 1 AbAAbA 1 LMb x \j bbAHHbiib 


53 83 


ATTCDV . 

QUJaRx : 


a a c o 


TAPPAPP T PTP TA ATA A TP.P.T A HC'C'TCX A PPPTP A TPT A TP-PT A A TPPPr ATPPPT AT P AGC 


5517 


an TPT . 


C "3 Q A 


IMMMMII MMMM llllll IIMMIII II II II MM 1 Mill 

TAPPAPPTPTPPA ATAATPPAAPPPTPPPPPTPATPTAPPPPAAPGGPATGGPTGTPAGP 


5443 


ATTPDV . 

QUr_KY : 


obio 


TTPPAPAPPPAPPPPPATPTPPTAPPPPPPAPPATPAPPPPPAPPATTPPAPGPTGPAAP 
1 X b- b-r^.b,_-ib7 b.bijrT.v_rb- b- v-. b.i-_ ibib 1 . L i-i.br b-bJOO X \_^r-i\_. v_. \_. *i_.J-i^ "v^_-_ X i uunu-'v^ i uutinu 


5577 


SBJCI : 


5444 


llllllll llllllll Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II IMIIIMI 

mrpr^A 1 a r 1 a ptp A ppppp A PP TPPTPPP A PPP A PP A TP A PPPPP A PP ATPPPPPPPTPP A A P 
I 1 b,bJAb-Abi 1 bjAbjv_.b-Mb-Ab,br X Lb 1 v_bi \^£\\j\j b^H.b- b-i-i X b-i^b-b.b.v_.b-jf-ib-V_.i-iX x 0^2T_ri^_ 


5503 


QUERY : 


5578 


A TiriTTTTtrpA A ATTV n-i/~i/^l A A 1 A A rpAAAT"?^ A A PTPP A TTP A P TPPPPPPTA AP A A AP.P-A A PAP 

ATCTCCCIGCCx AiCbjAb/AAlbibjb- 1 lAAAbibbAl xbjAbjxbjbjb,bjb,b-xi^bj/i/^b^ 


D O J / 


SBJCT : 


5504 


Mill lllll IMIIIMMII 1 MMMM IMIMIMM II IMIIIMI 

A rnrtrnr*rnr*TVr*r t r i r*T\ T'A 1 PA PA ATPPPPTP A A PTPPATPPAP.TP.P-PPPPTP.APPA APPA APAP 
AxCiCxCi vjb,b-bJAl obfAbrAA X bjbjb-b- 1 uAAb 1 bbAl b,bt_-ibr XbJbjb-brb-b-X bri^V3bj/V\bibJi-irib._-i^ji 


5563 


QUJtiKx : 


a C Q Q 

boo o 


ATT A A APPPA A APTP A PPATPTTTPPP APPA APPTPPPPPTPP ATPP A AP A A ATPTPTTG 
All AAAbjbjb-Ar__-ib3 1 v*vriv_ v_jf-_ lbl I I bJbr b-i^brvj7i-ii-i.bi b- X b-bibiVr X v_ v-ori X \3\3r\±r^±^±\t\ x \- x v_ x x vj 


5697 


OD TPT . 


CT C C A 

bbo4 


II MIIIIIIIIIIIMMIMIM MMMM llllllll Mill llllll M 

A TP A A APPPA A APTP A PPATPTTTPPP APPA APPTTPPPPTPP APPPA APPA ATPTPPTP 
A 1 b^LAAbrbrb-AAAvj 1 b~r_v_ v->_ ibi 1 1 bibi bri-lbrbxHi-lo b. X X b-brbibi X L^i^v_jvj_-irtovJJ-iri l\-lV->_i.u 


5623 




ZZ C Q Q 

bb y o 


TPPATTP APT ATP ATPP A A AT ATT PPP APTP A A A APATPTATPATPAPPAPPPPA APTTP 
1 L.b-A 1 1 bxrlb. 1 1 bjA X b-brAAA ±r\ 1 1 b-o br_-_v_. X Oi-irt/iribJr., ibiril VxM X v_r_-_ v_ ^^-i*^ \_\_7 \j£-iJr±\3 ll_ 


5757 


Ot) TPT . 


bo Z^fc 


MIMIII Mill MMMM II II 11 MMMM MIMMMIIMI III 

TPPATTP ATTATPAPPPA A ATATPPPTAPPPAPA APATPTAPPATPAPPAPPPPA AATTP 
X bbAl 1 brll irii brH.bbbi.rir_rtl ril bbbl 1 AbbbAVjAnbr-i. b irtbUAluAuv_nv-V-wnr_ri J. x \_ 


5683 


QUERY : 


5758 


ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGG 


5817 


SBJCT : 


5684 


IIIIMIIIIIMI IlilillllllllMIIIIIIMIM Hill 1 1 INN III 

ACCCTGAGGATCATCTATGACCAGGTGGGCCGCCCCTTCCTGTGGCTCCCGAGCAGTGGG 


5743 


QUERY : 


5818 


CTGGG^GCTGTCAACGTGTC^TACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGG 

MINIM Mill 1 1 II IMIIIIIIMMMIII MM II II Mill III 

CTGGC^GCCGTCAATGTCTCCTACTTCTTCAATGGGCGCTTGGCCGGCCTCCAGCGAGGG 


5877 


SBJCT : 


5744 


5803 



52 15966-697 





OO / o 


LTLLrt A L7.rt.L7 L.L7jri.L7.f"iL7 L7£^L~rtLT.i _ i.L~rA A ^vjriUfiflvjwTTi^jvj\,^\j | i-*fii ulj iui ■ v — x-.^ v^-^v _l \j j. j. > — * 1 — j- 


5937 


CD TP 1 ? * 


boU^fc 


MMMMMIIIIIlMliltl IIIIIIIIIIIIII IIIIIMIIIIMIIIIMI 

GCCATGAGCGAGAGGACAGACATTGACAAGCAAGGCCGGATCGTGTCCCGCATGTTCGCC 


5863 




J J JO 


PAPPPPAA APTPTPPAPPTAPTPPTAPPTTPAPAAGTPPATGGTCCTCCTGCTTCAGAGC 

L7rt.LL7L7L7.rtx^Lr"l.LT A L7 A UVJfivJV- J. iiL. X L L» X ^IL. L- X X v_TZT.L~f^rtLJ x v»» x vjvj j,vv-j,^\*iijvja -t * — nvnu n_ 

IMIIMIIII Mil! 1! INI! 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 ! 1 1 1111! IIIIM 

PAPPPPAA APTPTPPAPTTATTPPTATPTTGAPAAGTPPATGGTCCTTCTGCTACAGAGC 


5997 


CD TPT • 
OIDU L X . 




5923 


OTTCD V . 

QUilKx : 


by ye 


r , ASPP r PPAPTA r rA r PA r P r PTPAPTA r T , f^APTPPTPTPTAPPnPPTPPTTGPPGTPAPPATGPPP 


6057 


oiiijLX : 




IIIIIMIIII M I'll MIMIIIMI M lllllll II Mill IIIIM 

PA a PPTP AP'PAPAT ATTT^a AT ATP, APT PPTPPPr ATPGPPTPPAPGPAGTPAPTATGPPP 


5983 




cr n c o 


APPPTnpPPPPT:PAPAP»PATPrTPPAPAPAPAPPTPPATPGGPTAPATPCGTAATATTTAC 


6117 


C D TPT * 

o-tsu l I : 


by 0*i 


II II 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 MIMIIIMI II llllllll II IIIIM 

a p,tp,t rcxcc ran c a p a a c a tgtp p a pgp apap pt c pat tggt tacat c CGAAACATT TAC 


6043 


ATTCDV . 


olio 


A ZiPPPPPPTna A aP.PA ATCPTTPP^PrTP ATPTTTPrAPTAPAGTGATG APGGPPGCATCCTG 


6177 


cd>tpt • 
oiiu l i : 


b ^± 


Mill II MMMIIMI IMIIIIIIIIMIMIIIIIIIIIIIIIIIIIMMI 

A APPPA P PPH A A A CiC A A TP P A TPftGTP A T PTTTG A PT A P AGTG ATG APGGCCGCAT C CTA 


6103 


QUJiKx : 


£TT 7Q 
D 1 / O 


A Ar2 A PPTPPTTTTTftlPPP A PPPP,APPyPP APPTGTTPTAP A AGTATGGGAAAPTPTCCAAG 


6237 


SBJL1 : 


6 11)4 


Mill II II llllllll II MMMMIIMIMIMIMM IIIIIMMIM 

7\ Annr»ATr i TTTPTTPPPPAPTPPPPf2PPAP.P,TP,TTPTAPA AftTATP,P,A A AAPTPTPPAAG 


6163 


LjU.t!i.K.x " 


a O T Q 


ttatpap APATTP.TPTA PPAP APT A PPPrPPHTP A PPTTPGGGT ATG APGAGAPPAPTGGT 


6297 


orSULi : 


blo4 


IIMIIIIIII IIIIIMIIII II MIMIIIMI IMIIIIIIIIMIMI III 

TTATPaPAPATAP,TPTAPPAPAPPAPAPPPGTPAPPTTTGGGTATGAPGAGAPPAPPGGT 


6223 


ATTPDV . 


bzyt) 


r 1r PP r rTP2i apa tpptp A a PPTPP A A APTPPPPPPTTPTPPTPP A PP ATP AGGTAPPGG A AG 


6357 


CD TOT . 

orJu lI : 


C O O A 


Ml MMMMIMM IMIIIIIIIMIIIIIIIMII IMMMMIMM III 

PTPPTP a APA TPPTP A A TPTPP A A APTPPtP^PtPtPTTPTPPTGT A PP ATP AGGTAPPGAAAG 


6283 


QUERY : 


6358 


7\mmpppft>-ip/-infi/-*pmj-ip7i ArT 1 5\r t S "POT* a r 1 AnnTTHTPPP APPA APPPA TPPTP A A TPPP 
AX 1 LjLjL.L-L.L-L. 1 LtLt 1 LrLjAL^fiALTL^fitj/i X L liiLilvjkJ X X L- X L-LLjALjV7J-)Lrt.Lr\jT^-rt.X X \-^t\t\X 


6417 


SBJCT : 


6284 


MM Mill MMMIMIMM IMIIMIIII IIIIIIM III MM III 

Lr 1 I LtLjLtLL.L.L. X XL? XLtLtALAALtVJALtAX X X J\KSA\jKj X lLlLi Lr/^LTVri J iALTLriA/-l. 1 LriAi L~rtrt.L.L?L-U 


6343 


QUERY : 


C A 1 Q 


7\ /-t/-irprprpr-i 7. r fr T7\ r*7\ fPTATTA TP 1 7A f~*7A A P Anr l T"T , PPPP ATPPP7A APP A TP A APPPPPTP ATA 


6477 


CD T/^T 1 . 




IIIIM II M 1 II 1 1 1 Mill MIMIIIMI II llllllll MUM! 

±\Kj\j X X XLrx\X iAirtLL Xi-il L-AL.LriAL.x-1-H.Xi-ivjv^ X ILLuLrtl iUL.UfiUV-fiiV^nrin^-^^uiUfiJ. X 


6403 




O *± / O 


APTPAPAPTPPPPTPPPPPTTPAPPTPTAPPGPTATGATGAGATTTPTGGPAAGGTGGAA 


653 7 




o4 U4 


II IMIMMIM M MINIMUM IIMMI MIMIII MIMMMIM 

APPPAP a PTPPPPTTPPTPTTPAPPTPT A PPPPT ATP APP APATTTPPGGP A AGGTGGAA 


6463 




C CT 1 Q 

b bo o 


PAPTTTPPTA APTTTPP APTP2XTPT ATT ATP AP ATP A APP APA TP ATP APP A PTPPPPTP 
Lr\L. 1 X X L7L7 Xi-i/\Lj X x xLrLri-lLTX Lrt.1 Lini X s\ X Lxrt.L-.rt. X Lririu K^±\\jt\ X LrlX LrlL-UrtL, XL7LLL7XL7 


6597 


CD TPT . 


C /i a A 


Mill II IMIMII llllllll II MIMMMMMIMMIIMIMIIM 

PAPTTPPPPA APTTTPPPPTP21TPTZ\PTAPPAPATPA APP APATPATPAPPAPTPPPGTP 
LjrtL. X 1 L.L7 Lt L./tx\l7 XII LtuLtl? X L-jHl i> L rt.L X rt.LL7rt.Lrt X Lr^rtLLrtvji-i.1 Lrl X LiAL LrtL x ltllvj X L 


6523 


ATTUDV . 


b by o 


ATPAPPPTPAPPA A APAPTTPP A PAPPPATPPPPPP ATP A APP APPTPPAPTATPAP ATP 
jft. 1 LtAll L 1 Lrt.Lr LirLrt/iLrtL X X LunLnLLLn X uuuuuun X LiT-rt.L7L7jrtL7Lr X L LriLj X r\ 1 LxrtLXrt X lt 


6657 


CD THT . 

orsu l l : 




Mill II Mill Mill IIIIIIIIIIIIII MMMM II II IIIIIIMI 

annnAPPPTTAPPA APP A PTTTP A PAPPPATPPPPPP ATP A APP A APTPP A AT ATP AGATG 

±\ X LxrtLLlL X X HuUfirt.L7LHL. X A X unLriLLLri X L7L7L7 LvjL-rt. X L-rii-iLrVji-lJ-iLT X UL-rlri iril L7-rtLjx-i X L7 


6583 


PTTTTDV - 


(Tf! C Q 
OOjo 


T T P C i Cl^Z r VC i (^^ , TC , A TPT A P TPP A TP A PPPTPP A A T A TP A P AP P A TPPP P APPGTG A T P A AP 
X X LLL7L7 A LLL 1 Lrt. X LT IAL X L7L7i-i. X LrriLL7Lj A LrLnrt Xrt.X uriUiluLl X L7L7LJ LriuuU x LJi-i. X L-ririLT 


6717 






MIIIIIIIIIIIMIIIMIMMI IMMIIMIMM Mill Mill IIIIM 

TTPPPPTPPPTP ATPT A PTPP ATP APTPTPP A ATATPAP APT ATPPPTAGGGTPATPAAP 

X X LLLjLt X LLL X LrtX L7 ArttL X L7LTrtt X LXrtL X LT A LJLrtrt. X jrt A L7/T.LiT.L7 lril Uuu XXA.L7LJLT X Crl X L-^lr-iu 


6643 


OTTT?DV - 


D / lO 


APPPAPPTA A A APTPPPPPPPTATPPPA AT APP APP A APT APAPPTATPAPTAPPATPPP 
i"iLf LfLri-iLT L XJrtjrtrtrtL X LtljLtliLLL xr\l ULL/UilriLLfiLLrnrtu AJ-iLrtLL A jM A L7i"iL A^-iLLr-riX L7LJL7 


6777 


CD TPT* - 
oi5u LI. 




Mill II Mill MMMIMIMM Mill IMMMMMMMM II III 

APPP A APTP A A APTAPPPPPPTATPPPA APAPP AP A A APTAP A PPT A TP APT ATP APPPP 
.rtLT L7L7jHLrt.L x urinrlL X ^iLTLTLrL L v L t\. X L7 L Lrtrt Lrt.L Lrt.Lfirtrt.L7 IriLfiLL A -rt A LTrt.\ — L rt. A L7rtLL7L7L7 


6703 




(T77P 


P A PPPPPAPPTPPAPAPPPTPPPPPTP A ATP APPPPPPPAPPTPPPPPTAPAPPT ATP A P 

ui^LLrLrLJ L-rtLr L A L Lrt.LriilL7 L L7 A L7L7LLL7 A L.rtrt. A L7rt.L LL7LLLL7rt.L L A LTLTLLtL X iAV — t-lLTL IrliwlL. 


6837 


CR.TPT • 


D / Uft 


Mill IMIMMIM IMIIMIIIIIIIIII II MMMIIMI IIIIIIMI 

P APPPPPAPPTPPAPAPTPTPPPPPTPA ATPAPPPPPPTAPPTPPPPPTATAPPTATPAP 
L7rt.LL7LrLLi-iL7L A LLrt.L7.rt.L7 A L7 X L7L7LLL7 x Lrt\rt. A LTr^LLLrLiLL ArtLL A L7L7LLTL A rt. A rt.L7 L A rt. A L7.rtL 


6763 




DO jo 


PTTA ATPPPA ATPTPPAPTTAPTPA APPPAPPPA APAPTPTPPPPPTPATPPPPTTPPPP 

L X A -rtrt A L7L7L7rtrt. A L X LLrtL X X J-iL X L7rt^lL L L jriLrLJ Lrtrt Lx^L7 A L7 A L7LL7LL A Lrt X L7LLL A X L7L.L7L 


6897 


SBJCT: 


6764 


II llllllll II Ml 1 II llllllll lllllll IMIIIIINIMI Ml 

CTCAATGGGAACCTGCACCTTCTAAACCCAGGAAACAGTGCTCGCCTCATGCCCTTACGC 


6823 


QUERY: 


6898 


TATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGAT 


6957 



10 
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65 



70 III MMMM II lllllll I III II Mill MMMIMIMM II IIIIM 

53 15966-697 



SBJCT : 5824 TATGACCTCCGTGACCGGATAACCAGGCTAGGGGACGTGCAGTACAAAATCGATGACGAT 6883 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 





oyDo 


p c a tptptp pp ap ap appgt ptg a c at pt tpgaatacaattc caagggcctcctaaca 


7017 


CD 1 r*"! 1 . 


O £5 0*± 


MMII MMMMM1MMI MMMM MMMM MMMMMI M M 

pppTATTTPTPppAPAPAnGPTPAGAPATPTTTGAATACJU^CTCCAAGGGCCTTCTGACG 


6943 


yUJiKx : 


/ U x.o 


AGAGCCTACAACAAGGCCAGCGGGTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGG 


7077 


oi5uk_ x . 




MMI MIMIMMMMMI MM! M MMMMMMM M 1 1 M M 

APAPPATAPAAPAAGGPPAGCGGATGGAGCGTGCAGTACCGCTATGACGGAGTGGGCCGC 


7003 




7fl 7 ft 


rGGGrTTrrTArAAGAPrAACCTGGGCCACCACCTGCAGTACTTCTACTCTGACCTCCAC 

MMMMMMMMMMMMMMMMMI MMMMMMM MMMMI 

rnr^PTTrrT A r A AG APPA ArCTGGGCCACCACCTACAGTACTTCTACTCCGACCTCCAC 


7137 




/ u U *± 


7063 


yUJiKx : 


HI O Q 
/ _L J O 


a a r p pp a r*r* r*p r* a tp a p rp a tgt tt a c a a t c a ptcp a ACTCGGAGATTAC C TCACTGTAC 


7197 


odJLI : 


*7 A £T /I 


MMI II II MMMMMI Mill Mlllllilll Mill Mill II III 

A APPPPAPAPPTATPAPPPATGTTTAPAAPPAPTPPAAPTPTGAGATCACCTCGCTCTAC 


7123 




/ iy o 


T7A r*P a PPTPPArzrzpPP A PPTPTTTPPPATPrH AG AGP AGP AGTGGGGAGGAGTACTATGTT 


7257 


CD TPT 1 . 


n 7/1 
/ X^^fc 


II 1 ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 IMIIIIIIMIIIIII Mill II II IIIMIII 

T A TP A PPTPPAPPPPP A PPT ATTTGPPATGG AG AGPAGTAGTGGTGAAGAATACTATGTC 


7183 






P P r*TPTrz A T A A P A P A f^riP A PT PP T P TGG PTGTGT T PAG PAT PAACGG C CTCATGATCAAA 


7317 


CD TPT » 


7 1 ft A 


Mill II Mill Mill IMIIIMIMM Ml Mill IIIIIIIMIIIII 

rrr'TPAGAPA ap a pgggg apppptptggptgtgtapagtatcaatggcctcatgatcaag 


7243 


fVTTCDV . 


/ Jio 


r i ar , PTPPAPTAPAPPPPPTATPGGGAGATTTATTATGAPTPPAAPPrPGACTTCCAGATG 


7377 


SBJCT : 


7244 


II Mlllllilll Illlllllllllll M IMMMIMI II MM Ml I I 

CAACTG CAGTACACAG C CTA T CjUGCjACjA 1 C i At- 1 A x OAL. 1 L.ui\/\ 1 U v^h.LxA.L- 1J.L \^PiSj±\ x vj 


7303 


QUERY : 


7378 


GTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAG 


7437 


j~t *t~i t/ n 1 1 

SBJCT : 


7304 


IMMMMMIM II II Illlllllllllll IIIMIII IIIMIII Mill 

GTCATTGGCTTCCACGGAGGCCTCTATGACCCCCTCACCAAGCTCGTCCACTTTACTCAA 


7363 




7 /! 1 Q 
/ 4fc O O 


CGTGATTATGATGTGCTGGCAGGAiCC^TGGACCTCCCCAGACTATACCATGTGGAAAAAC 7497 

i t 1 t 1 t 1 I I I I 1 1 1 l 1 i 1 I 1 1 ! 1 1 1 lllll 1 t 1 1 1 MM! M M I 1 M t I t I t 


CD THT . 


7 "3 £T A 


MMMMMI M M M M M i M 1 MMI MMI Mill M 1 1 1 1 1 1 1 1 Ml 
CGTGATTATGACGTGCTGGCAGGi^GGTGGACGTCCCCCGACTACACCATGTGGAGGAAC 


7423 




/ *± J7 O 


GTGGGCAAGGAG C CGG C C C C CT TTAAQCTGTATATGTT CAAGAG CAACAAT C CT CT CAGC 


7557 


CD "TV^T" . 


/4z4 


MIMMMIMM MMMM MUllll llllllllll llllllllllll III 

GTGGGC^GGAGCC^GCCCCCTTC^CCTGTACATGTTO^GAACAAC^TCCTCTGAGC 


7483 


QUJiRY : 


"7 CT O 

/bbo 


AGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGA 


7617 


SBJCx : 


7484 


1 MMII II II 1 1 1 1 1 1 1 1 ! 1 E 1 1 1 1 f 1 Mill f > M 1 1 1 ) 1 1 1 M 1 1 1 1 1 1 1 1 

AATGAGCTGGACTTAAAGAACTACGTGACAGACGTGAAGAGCTGGCTTGTGATGTTTGGA 


7543 




/ O X, O 


TTTCAGCTTAGCJ^CATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCT 


7677 


Ox3U 1 . 




I1IIIM1 MIIIMMIIllllll MIIMIIIIIIIIIIIIM1II IIIMIII 

TTTCJ^GCTCAGCAACATCATTCCTGGATTCCCGAGAGCCAAAATGTATTTTGTGCCTCCC 


7603 




/ o / o 


CCCTATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAA 


7737 


CD TPT . 




IIIIIIIM IIIIIMIIMIIIIIIM Mill IMIMMMIIMMMMIM 

CCCTATGAACTGTCAGAGAGTCAAGCAAGCGAGAACGGACAGCTCATTACAGGTGTCCAG 


7663 




7 7 T Q 


CAGAC^^CAGAGAGACATAACCAGGCCTTCATGGCTCTGGAAGGACAGGT^ 


7797 


CD Tr^T . 




MINIM lllll MIMIMMMIM IMIMIMIIIIIIIIIIIII MMII 

CAGACAACTGAGAGGCATAACCAGGCCTT CCTGGCTCTGGAAGGACAGGTCAT CACTAAA 


7723 




7 7 Q £3 


AAGCTCCACGCCAGCATCCGAGAGAAAGC^GGTC^CTGGTTTGCCACCACCACGCCCATC 


7857 


oxt>u v— x . 


110 A 


MMMM IMMMMIMMMMIMM MMMMMI MMMM MMII 

AAGCTCCATGCCAGCATCCGAGAGAAAGCAGGCCACTGGTTTGCTACCACCACACCCATC 


7783 




7ft ^ft 


ATTGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACGGGCGTGTCC^ 


7917 


SBJCT : 


7784 


II MMIMMMMMMIMMMIMMMMMMMIMM II Mill Ml 

ATCGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACAGGAGTGTCTAGC 


7843 


QUERY: 


7918 


ATCGCCAGCGAAGATAGCCGC^GGTGGCATCTGTGCTGAACAACGCCTACTACCTGGAC 


7977 


SBJCT : 


7844 


IIIMIII M 1 1 MMMMMI Mill Ml IMMII IIIMIIM 1 Ml 

ATCGCCAGTGAGGACAGCCGCAAGGTAGCATCCGTGTTGAACAATGCCTACTACTTAGAC 


7903 



54 15966-697 



55 



60 



65 



QUERY : 


7978 


AAGATGCACTACAGCATCGAGGGCAAGGAC^CLCAC 1 AL 111 Lx KaAAbiA 1 1 WaL. i UALtLL. 


0 \J 3 ! 


SB JCT : 


7904 


MllllllMIIIMilllllllMIIIIIII 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 III 1 II 

71 ti 7\ rrt/^t^Tv nrttn ^71 /~i/^i7\ T>00 A OOO OA AOO A OA OA OA OH" 1 A prrrTiiTinmp a AO A TOOO P/iPPnPn 

AAGATGCACTACAGCATCGAGGGLAAGGACAuAuAu 1 AL i 1 ib 1 LjAALtA l ultLtUU'L lv^laj 




QUERY: 


8038 


GA 1 GGLGALC 1 G(j 1 UAxJAL 1 UAU L. A L. LA x v-buLLuUrtrt\jb i at 1 J^^JtUjj\kj\^Kj\jkj\j x vj 


8097 


SB JCT : 


7964 


Mill lllllllllll Mill llllllll II IIIIIMIMI Mill MUM 

n a rvnnrvn a. nn'vnn nnn a pn o^a. po a a a CHA TTnrin.PnPAArtnTnnTnfiAnAnTftnnfTrc^ 


8023 


ATT 1 \ I i T T 

QUERY : 


8098 


7\ ti oonno a i^pp , T i p , T , ppp7\pppp7\rpp r T i PPTfiOTrAZirricr , i\(7(^z\rTrnz\ Af^TTPAfY^AAC 1 


8157 


SB JCT : 


8024 


iiiiiiii inn imiimmiim n imimimiimm m 

a a nnmm\rtpr«Tir<T>r<7\ P7\PPPP7\ /-t/^/^rpz-i r ir Pnr ,r Pr , Ts. A 'Vnn m^r^CtA PTPP.a AP.nTT'P A PPA A P 
AACG 1 GAGGG 1 G 1 CJACAoCLLaLLtL. 1 1 tjvj 1 LrAA x x Loi-irtLrVj 1 x Uiiu^rtrtu 


8083 


QUERY: 


8158 


ATTGAGTTCCAGTACTCCACGCTGL 1GC xCA.GL.Al LLajL. IAILtLtLL 1 i^LL-LLLLxALALL 


Oil / 


SB JCT : 


8084 


mmmmmmiimmiiiiii iiiiiiii mmmmiim 

ATTGAGTTCCAGTACTCCACGCTGCTGCTCAGTATCCGCTACGGCCTCACCCCCGACACG 




QUERY : 


8218 


CTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGLtC(jACjA 1 LtLtLtLAL-Lt^ll. 

minimi Miiiimiiimmii m limn iiiiiiii 11 m 

CTGGACGAAGAAAAGGCCCGCGTCCTGGACCAAGCGGGACAGAGAGCCCTGGGTACTGCC 


ft 9 7*7 


SB JCT : 


8144 


8203 


QUERY : 


8278 


mooo/-»oA aooaooaooaoa a ^ppp^pppAPPriPTiPSPfiPPPniinpp^iPPTnTnnAPTCAn 
TGGGCLAAGGAGGALtLIALxAAAL^CL^oo^ x o- x x oi-io 

iimmiiimimiimmmiimiiimi iimimiim in 

TGGGCCAAGGAGCAGCAGAAAGGLAGGGAUvjU^ ibi LtLjAHjIjAo 


8337 


SB JCT : 


8204 


OZOJ 


QUERY : 


8338 


GG CGAGAAG CaGCAGCTTCTGAGCACCGGG CGCGTGUAACjCjVj 1 AC(jALiLj(jA 1 A 1 1 PiSJj 1 <a 


0 J J / 


SB JCT : 


8264 


mmiiiiim ii iiiiiiii ii ii ii inn ii inn iiiiiiii 

GGCGAGAAGCAGCAACTCCTGAGCACGGGACGGGTACAAGGTTATGAGGGCTATTACGTA 




QUERY : 


8398 


CTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAAL^ 111 IAALjA 

mil urn 11 11111 11111 mimimiimiimiim mm 

CTTCCGGTGGaACAGTACCCGGAGCTGGCAGACAGTAGCAGC^CATCCAGTTCTTAAGA 


CIA c;7 


SB JCT : 


8324 


O j O j 


QUERY: 


8458 


ninii m/-i 71 /-) 71 mnn/TA 71 7\ 0 a /~t/™"T> a A OA AAA t 1 A A mpirppTP OO A '["MPP'^'CPT^PTP A A r T x n.nf^ r P 

CAGaATGAGATGGGAAAGAGGTAACAAAA 1 AA 1 L. 1 CjL. 1 CjLUA 1 lL,uliLrlu.l LrAA I tjvjrL. 1 

1 1 ! 1 1 1 1 1 1 1 i ! 1 1 ! 1 1 i 1 1 1 i 1 ! 1 ! 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 III MM Mill 

CAGAATGAGATGGGAAAGAGGTAACAAaATAACCTGCTGCCACCTCTTCTCTGGGTGGCT 




SB JCT : 


8384 


8443 


QUERY : 


8518 


CAGCAGGAGTAaCTGTTATCTCCTCTCC rAAGGAGAlGAALrAL-L 1 AAUAvjULjLtLAL. 1vjL-vj 

MIMIMI MUM 1 IMIMIMIIIIIIII MIIIIMMI III MM 1 

--j /-h »-~» ti y-t>-t7i t\ rtrn/^mn t\ nnmnnmnmnnrnT\ A OO A 0 A OO A A OA OOT 1 A AO f~^(~* r^C* O A OT^O A O 

CAG CAGGAGCAACTGTGACCTC CT CT C C TAAGGAUAGGAACjAU L 1 AAL. - UAL. 1 vjAvj 


oj / / 


SB JCT : 


8444 


OCAO 

0 jUZ 


QUERY : 


8578 


GCTGGGCTGCTTTAGGAGACCAAGTGGCAAGAAAGCTCACATi ill iLrALrl luAAAivaLi 


ft C "5 7 


SB JCT : 


8503 


II IMII1MIIIMI IMMMMIMIIMMIMMMIIIIMMIIIIIIII 

GCCGGGCTGCTTTAGGATCCCAAGTGGCAAGAAAGCTCAC^TTTTTTGAGTTCaAATGCT 


8562 


QUERY : 


8638 


AC TGT C CAAGCGAGAAGT C CCT CAT C C TGAAGTAGACTAAAG CC CGGC 8685 

MINI Mill IMIIIIIIIIIMIIIIMIIIII IIIIIIII 

ACTGTCTAAGCGCAAAGTCCCTCATCCTGAAGTAGACTAGAGCCCGGC 8610 




SB JCT : 


8563 





10 



15 
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SCORE = 1570 BITS (792), EXPECT = 0.0 
50 IDENTITIES = 1095/1196 (91%) 

STRAND = PLUS / PLUS 



QUERY : 


270 


SB JCT : 


103 


QUERY: 


330 


SB JCT : 


163 


QUERY : 


390 


SB JCT : 


223 


QUERY : 


450 


SB JCT : 


283 


QUERY : 


510 



iiiiiiimiiiiiiiiiiMmiiii MMiiiimiM inn mil 

'GGAATAATGGATGTAAAGGACCGGCGACATCGCTCTTTGACCAGGGGACGGTGTGG 
lGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACA 

MMMMIIM: 11111111111111111111111111111 IIIIIIII II 

^GAGTGTCGCTACACCAGCTCCTCTCTGGACAGTGAGGACTGCCGTGTGCCCACTCA 
^TCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTA 

iiiiiiii iiiiiiiiiii limn iMiiiiiiiiiiiiii mimi 

jTCCTACAGTTCCAGTGAGACCTTGAAGGCTTATGACCATGACAGCAGAATGCACTA 
AACCGAGTCACAGACCTCATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAAC 

iimmiiiMim i milium iiimm iiiimm n 



70 III II Mill I II III MM Mill IMIMIMIIIIIIMIM II Mill 

55 15966-697 



10 



15 



SBJCT: 343 AAACTTCACCCTGGCAGAATTGGGAATCTGCGAGCCCTCCCCACACCGAAGTGGTTACTG 402 
HCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTG 629 

IIIMIIMI 1 1 1 1 ( IlilllllllilMMI Mill llllllll II Mill 

^CGACATGGGTATCCTCCACCAGGGCTACTCCCTGAGCACTGGGTCTGATGCAGACTC 462 
^CACCGAGGGAGGGATGTCTGCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAA 689 

MIMMMMMMMMIMIMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M II 1 1 1 II 

^CACCGAGGGAGGGATGTCTCCAGAACATGCCATCAGACTGTGGGGACGAGGGATAAA 522 
^CAGGCGCAGTTCCGGCCTGTCCAGTCGTGAAAACTCGGCCCTTACCCTGACTGACTC 74 9 

1 1 1 1 ! 1 1 1 1 1 ii iii mm! ii 1 1 iiiiiMiiiMii milium 

CCAGGCGCAGCTCTGGCTTGTCCAGCCGCGAGAACTCGGCCCTTACTCTGACTGACTC 582 
ACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCATTCCACCTACATCCTCGCC 809 

mi inn nm inn m n mmimmmmmmii i 

&CAATGAAAATAAATCGGATGACGACAATGGT CGTCC CATT CCACCTACAT CC T CGTC 642 
GTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 869 

20 Ml | M || 1 1| | M I M M M M 1 1 1 1 M 1 1 II I II 1 1 II M II I II 1 1 II II II II II 

^GCCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 702 
^TGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGA 929 

MIIIMMIMMMIIIIMMMIIIIM I II I II II II 1 1 II II I M I M I M I 

^TGCCATTGCTAGACAGCAACACCTCCCATCAGATCATGGACACCAACCCTGATGAGGA 762 
rTCTCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAG 989 

I II 1 1 1 M 1 1 1 1 1 1 II II II 1 1 M 1 1 II M 1 1 1 ! I II I II II II II M 1 1 1 M II I! M 

r TCTCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAG 822 
3GCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACCCCCTCACAA 104 

1 1 II M M MIMIMMMMIM M MMIIIIIIIIM MMIIIMM II 

GGCCCTCCAAACCACCACAGCCAGTCAACACTGAGGCCCCCTCTGCCACCCCCTCATAA 882 
CACACGCTGTCCCATCACCACTCGTCCGCCAACTCCCTCAACAGGAACTCACTGACCAA 110 

nm nmm mimi ii miiiimimiiimmimmii 

CACACCCTGTCCCACCACCACTCCTCGGCCAACTCCCTCAACAGGAACTCACTGACCAA 942 
CGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCCAATGACCTGGCCACCACACCAGA 116 

40 Mill Ml I III MMIIIMM II II MM! I II II I III MM MM Mill 

5GCGGAGTCAAATCCACGCCCCAGCTCCTGCGCCCAACGACCTGGCCACCACCCCAGA 100 
XGTTCAGCTTCAGGACAGCTGGGTGCTAAACAGCAACGTGCCACTGGAGACCCGGCA 12 2 

I MIMIII Mill IMIIMMM Mill Mill IMIIIMII! Mill 

:TGTTCAGCTCCAGGATAGCTGGGTGCTGAACAGTAACGTCCCACTGGAGACTCGGCA 106 
rCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGATA 12 8 

1 1 ii nm ii ii ii ii imimi 1 1 1 1 1 1 i 1 1 1 1 1 ii 1 1 iiniiii 

rCCTTTTCAAAACGTCGTCTGGAAGCACACCCCTGTTCAGCAGCTCTTCTCCGGGATA 112 
HTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTT 134 

mimiiiiii ii nm ii ii n um mini mini n 

-TTTGACCTCAGGGACCGTTTATACACCACCACCCCGCCTGCTGCCACGGAATACATT 118 
^CAGGAAGGCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGC 140 

Ilimilll ill!! 1 1 1 1 1 M 1 1 1 ! M M I ! llllllllllllll 

HCAGGAAGGCCTTCAAGCTGAAGAAACCCTCCAAATACTGCAGTTGGAAATGTGCTGC 124 
rCTCCGCCATTGCCGCGGCCCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAG 1465 

60 Ml || HIM HIM Mill 1 1 II III II III II III III 1 1 Mil I III 

rGTCTGCCATCGCCGCCGCCCTCCTCTTGGCCATTTTGCTGGCATATTTCATAG 1298 

BITS (734), EXPECT = 0.0 
IDENTITIES = 1000/1088 (91%) , GAPS = 3/1088 (0%) 
65 STRAND = PLUS / PLUS 



25 



30 



35 



45 



50 



55 



CO T PT 1 . 




OTTTDV . 

QuhiKi ; 


570 


on i r^r 1 . 
bbJCJ, : 


A C\ 1 


{-\T TTTl TJ V" . 




SBuLl : 




QUERY : 


690 


SBJC1 : 


COT 


/"VT TT71 TTV7" - 

QUERY : 


750 


on T/" lr P » 

SBJCT : 




QUhRY : 


810 


boo LI : 


C A 1 


ATT 1 .1 1 - 

QUERY : 


870 


SBJCT : 


703 


r>T7nn XT' 

QUERY : 


93 0 


SBJC1 : 


/oi 


QUERY : 


990 


SBJCT : 


823 


QUERY : 


1050 


SBJCT : 


883 


QUERY : 


1110 


SBJCT : 


943 


QUERY : 


1170 






QUERY : 


123 0 


^>t*% ~r/— i m 

SBJCT : 


1063 


QUERY : 


1290 


C 1 "D TC^T 1 . 

SBJCi : 


1123 


QUERY : 


1350 


SBJCT : 


1183 


QUERY : 


1410 


SBJCT : 


1243 


SCORE 


= 14 



70 



QUERY: 14S4 AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 1523 

llllllllllll IIIIIIIIIMMIIMIIIIIIIII IIIIIIMIIIIMIIIIII 

SBJCT: 1420 AGTGCCCTGGTCATTGAAAAACAGCAGCATAGACAGTGGCGAAGCAGAAGTTGGTCGGCG 14 79 

56 15966-697 



QUERY: 1524 GG TAACA.CAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCC 1583 

Ml Mill IMMIMMIIIIIMIMIMMMM II M 1 1 1 ! 1 1 1 1 1 M 1 1 1! 

SBJCT: 1480 GGTGACACAGGAAGTCCCACCAGGGGTGTTTTGGAGGTCCCAGATTCACATCAGTCAGCC 1539 
5 QUERY - 1584 CCAGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 1643 

It 1 1 i 1 1 1 1 1 1 1 M 1 E 1 1 1 M 1 1 1 1 II HIM II Mill Mill II Mill 

SBJCT: 154 0 TCAATTCTTAAAGTTCAACATCTCCCTGGGCAAGGATGCCCTCTTCGGTGTCTATATAAG 1599 
QUERY- 1644 AAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 1703 

10 ' 1 1 1 M I II II M I II 1 1 1 1 II I M M M I M M I II II M 1 1 II 1 1 1 M I II II 

SBJCT: 1600 GAGAGGACTACCACCGTCTCATGCCCAGTATGACTTCATGGAACGCCTGGATGGAAAGGA 1659 
QUERY * 1704 GAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAA 1763 

Ml Mill Mill II I II II M M M 1 1 II M M II II HIM MM Mill 

15 SBJCT: 1660 GAAATGGAGCGTGGTCGAGTCGCCCAGGGAACGCCGGAGCATCCAGACTCTGGTGCAGAA 1719 
QUERY - 1764 TGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGA 1823 

II II IMIMIIIIIMII MMIIIIIIIIMMIII I f I i 1 1 1 i 1 1 1 1 1 1 1 1 1 

SBJCT: 1720 CGAGGCTGTGTTTGTGCAGTACTTGGATGTGGGCCTGTGGCACCTGGCCTTCTACAATGA 1779 



20 



40 



60 



70 



QUERY * 1824 TGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTG 1883 

II M II 11 1 IIIIMM MMMM IMIIMM I II I II M M II M M II I 

SBJCT: 178 0 CGGGAAGGACAAGGAGATGGTCTCCTTCAACACTGTTGTCTTAGATTCAGTGCAGGACTG 183 9 



25 QUERY: 1884 TCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGG 1943 

I II I M Mill M HIM IIIIMM Mill M ) II II ! I II H 1 1 M II ! I 

SBJCT: 1840 TCCACGGAACTGTCACGGGAACGGTGAATGCGTGTCTGGACTGTGTCACTGTTTCCCAGG 1899 
QUERY * 1944 ATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACA 2003 

30 Ml II II I MMIMIIMIMIIMIIIIIIIIMI MMMM II MMMM 

SBJCT: 1900 ATTCCTAGGTGCAGACTGTGCTAAAGCTGCCTGCCCTGTACTGTGCAGCGGAAATGGACA 1959 
QUERY: 2004 ATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCC 2063 

I II II II M II i 1 1 1 1 i M M H M 1 1 M 1 1 1 1 1 1 M M M E 1 1 1 1 1 1 1 M HIM 

35 SBJCT : 1960 GTATTCTAAAGGAACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGTGATGTGCC 2019 
QUERY : 2064 CATGAATCAGTGCATCGATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2123 

Mill M II IMMMIIMIM IIIIMM II M M M II 1 1 II II II I II II 

SBJCT: 2020 TATGAACCAATGTATCGATCCTTCCTGTGGGGGCCATGGCTCCTGCATTGATGGGAACTG 2079 



QUERY- 2124 TGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCAC 2183 

M M 11 1 1 1 1 1 1 1 11 1 1 IMMIMMMMIMMIMMMMMIMM II 

SBJCT: 2080 CGTGTGTGCTGCTGGCTACAAGGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCTAC 2139 



45 QUERY- 2184 CTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGGTGG 2243 

MMMIIMM II I M M M II II 1 1 1 II I II II IIIIMM IMIMIMM 

SBJCT: 214 0 CTGCTCCAGCCATGGTGTCTGTGTGAATGGAGAGTGTCTATGCAGCCCCGGCTGGGGTGG 2199 
QUERY: 2244 TCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTA 2303 

50 Ml Ml MM MM II I MM MM II MM I HUM 1 11 II HIM MM III II 

SBJCT: 22 00 TCTCAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGTAGTGGGCATGGCACTTA 2259 
QUERY - 23 04 CCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGT 2363 

Ml MUM I IMMMMIMM HIM II II M II 11 1 1 1 II M 1 1 1 II II II 

55 SBJCT: 22 60 CCTCCCTGACTCCGGCCTCTGCAGCTGTGATCCGAACTGGATGGGTCCCGACTGCTCTGT 2319 

QUERY: 2364 TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2423 

1 MMMMMMIMMMMMMMMMIMMMMMMMMIMIMM 

SBJCT: 232 0 T GTGTGCTCAGTAGACTGT^GCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2376 



QUERY: 2424 TGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 24 83 

I II II II M II 11 1 II II I II II I MMMMMMMMMMMMMMMIMM 

SBJCT: 23 77 TGAAGAGGGCTGGACAGGCGCAGCTTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 2436 



65 QUERY : 2484 GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2543 

MMMMMMMMM IIIIMM III MM MM Ml II II MM MMMIIMM 

SBJCT: 2437 GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2496 



QUERY: 2544 CACCATTG 2551 

MMMM 

57 15966-697 



SBJCT: 2497 CACCATTG 2504 
SCORE = 105 BITS (53), EXPECT = 5E-19 
IDENTITIES = 81/89 (91%), GAPS = 1/89 (1%) 
STRAND = PLUS / PLUS 

5 

QUERY : 8711 AACGAATGAATGAACAGACACACACAATGTTCCAAGTTCCCCTAAAATATGACCC^ 8770 

MINI MINI! I HUM! Ml 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 

SBJCT: 8655 AACGAACGAATGAAAACAC^CACAAAATGTTTCAAGTT 8714 
10 QUERY: 8771 TTCTGGGTCT-ACGCAGAAAAGAGACGCA 8798 

III 1 1 1 1 1 1 I IMIIIIMMIMM 

SBJCT: 8715 TTCCGGGTCTAAGGCAGAAAAGAGACGCA 8743 

SCORE =48.1 BITS (24), EXPECT = 0.093 
15 IDENTITIES = 30/32 (93%) 

STRAND = PLUS / PLUS 

QUERY: 4 75 CACCGGGAGTCAGATGAGTTTCCTAGACAAGG 506 

llililllill lllllllll IIIIIIMM 

20 SBJCT: 7 CACCGGGAGTCCGATGAGTTTTCTAGACAAGG 38 

In this search it was also found that the FCTR3bcd and e nucleic acids had homology to 
three fragments of Rattus norvegicus neurestin alpha. It has 5498 of 6132 bases (89%) identical 
to bases 2527-8658, 1081 of 1196 bases (90%) identical to bases 123-1318, 996 of 1088 bases 
25 (91%) identical to bases 1440-2527 of Rattus norvegicus neurestin alpha (GenBank 
Acc:NMJ)20088.1) (Table 3N). 

Table 3N. BLASTN of FCTR3b, c, d, and e against Rattus norvegicus Neurestin alpha 

mRNA (SEQ ID NO:66) 

> GI | 9910319 | REF 1 NM 020088. 1 I RATTUS NORVEGICUS NEURESTIN ALPHA (LOC56762) , MRNA 
30 LENGTH = 8689 

SCORE = 7129 BITS (3596), EXPECT = 0.0 
IDENTITIES = 5498/6132 (89%) 
STRAND = PLUS / PLUS 

35 
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60 



QUERY: 


2578 


GATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 


2637 


SBJCT : 


2527 


IIIIIIIMIIIII MMMIIMIIIIIIIMIIIIIIIIIIMIMMMIMIMI 

GATGGCTGCCCTGATTTGTGCAACGGTAACGGGAGATGCACACTGGGTCAGAACAGCTGG 


2586 


QUERY : 


2638 


CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACTTCC 

IIIMIIIIIIMIMIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIMI III 

CAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCATGGAAACCTCC 


2697 


SBJCT : 


2587 


2646 


QUERY : 


2698 


TGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGATTGTTTGGACCCTGACTGC 


2757 


SBJCT: 


2647 


II IMMIIIIIMIIIIMIIIIIIMIIIMIIMIII II IMIIMIIIIIII 

TGCGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGACTGCCTGGACCCTGACTGC 


2706 


QUERY : 


2758 


TGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGGGACCCACTGGAC 


2817 


SBJCT : 


2707 


Mill IMIMMIIMIIMIMIIIMIIMI MINIM MMMM II M 1 

TGCCTCCAGTCAGCCTGTCAGAACAGCCTGCTCTGTCGGGGGTCTCGGGACCCCTTGGAC 


2766 


QUERY : 


2818 


ATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTATGACCGTATCAAG 


2877 


SBJCT : 


2767 


IMMIIMM MMMM II Mill M IMIMMMMMMI MIIMMI 

ATCATTCAGCAAGGCCAGACAGACTGGCCTGCGGTGAAGTCCTTCTATGATCGTATCAAG 


2826 


QUERY: 


2878 


CTCTTGGC^GGCAAGGATAGCACCCACATCATTCCTGGAGAGAACCCTTTCAACAGCAGC 


2937 


SBJCT : 


2827 


MIMIMMMIMM MMMIMMMIMMIMM Mill Mill MUM 

CTCTTGGCAGGCAAGGACAGCACCCACATCATTCCTGGAGACAACCCCTTCAATAGCAGC 


2886 


QUERY: 


2938 


TTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATGGAACTCCCCTGGTCGGTGTG 


2997 


SBJCT : 


2887 


INI Mill IIIIIIIIIIINIIIIIII II Mill II MMMM MUM 

CTGGTGTCTCTGATCCGAGGCCAAGTAGTAACCACGGATGGGACCCCCCTGGTGGGTGTG 


2946 



58 15966-697 



QUERY • 2998 AACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCATCACCCGCCAGGATGGCACG 3057 

I! HllliMIIIIIMIMIlMMM 1 1 1 i 1 1 1 1 1 ! ! M I MINIM MM!! 

SBJCT* 2947 AATGTGTCTTTTGTCAAGTACCCAAAATATGGCTACACCATCACTCGCCAGGACGGCACG 3006 

5 

QUERY * 3058 TTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAGCGAGCCCCGTTC 3117 

M MM1MI I! MIM M M MMMMM MMIMMMMMM Ml 

SBJCT : 3007 TTTGACCTGATTGCCAATGGGGGCTCTGCCTTGACTCTTCACTTTGAGCGAGCCCCTTTC 3 066 
10 QUERY * 3118 ATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCCATGGACACCCTG 3177 

MIMM E I i 1 1 1 1 1 E II MM MIMMMMMM M IMMIIIMMMI 

SBJCT: 3067 ATGAGCCGGGAGCGCACAGTATGGCCGCCGTGGAACAGCTTCTATGCCATGGACACCCTG 3126 
QUERY* 3178 GTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 323 7 

15 1 1 MUM II M 1 1 II II M I M M I M II II II M I M II 1 1 II M II I! I! I M M 

SBJCT: 3127 GTAATGAAGACGGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGCTTTGTCCGGCCT 3186 
QUERY * 323 8 GATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCCCCTGGGCAGAAT 3297 

Mill M M II M M I M I M IMMMIIMMI M Ml MUM I Ml 

20 SBJCT: 3187 GATCCGATCATCATCTCCTCTCCTCTGTCCACCTTCTTCAGCGCTTCCCCTGCGGCGAAC 324 6 
QUERY- 3 298 CCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTG 3 357 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I II MIMMMMIM MM Ml 

SBJCT: 3 247 C C CAT TG TGC CTGAGACC CAGGTT CT T CATGAGGAGATCGAGCT C C CTGG CACCAACGTG 33 06 



25 
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* QUERY * 3358 AAACTTCGCTATCTGAGCTCTAGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATG 3417 

M II M II II MMI Mill M I It 1 1 11 M 1 1 1 11 1 ! M i 1 1 M M I i I f 1 1 

SBJCT: 3307 AAG CT C CGTTACCT CAGCT CCAGAACAGCAGGGTACAAGT CACTG CTGAAGAT CACCATG 3366 



30 QUERY* 3418 ACCCAGTCCACAGTGCCCCTGAACCTCATTAGGGTTCACCTGATGGTGGCTGTCGAGGGG 34 77 

II I II 1 1 M II MUM MMMMM IMMMI MIMM II II MUM 

SBJCT: 3 367 ACCCAGTCCACGGTGCCCTTGAACCTCATCCGGGTTCACTTGATGGTTGCCGTGGAGGGG 3426 
QUERY * 3478 CATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCTCCACCTTCATCTGGGAC 3 537 

35 Ml MIMMMMIM I II II II 1 1 M II II II 1 1 II 1 1 1 1 Ml II Ml MIMM 

SBJCT: 3427 CATCTCTTCCAGAAGTCGTTCCAGGCTTCTCCCAACCTGGCCTACACATTCATCTGGGAC 3486 
QUERY * 3 53 8 AAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGTGTCTGTCGGG 3597 

IMMMI M MIMMMMIM Mill II II MIMMMMMMM II 

40 SBJCT: 34 87 AAGACAGACGCTTATGGCCAAAGGGTTTATGGCCTATCGGATGCTGTTGTGTCTGTTGGA 3546 
QUERY* 3598 TTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGAAAAGGACAGCCCTCCTTCAG 3657 

M I II II I II M II I M IMMMI M II MMI MIMMMMIM MIM 

SBJCT: 3547 TTTGAATATGAGACCTGCCCCAGTCTCATCCTGTGGGAAAAAAGGACAGCCCTACTTCAA 3 606 



QUERY: 3 658 GGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCCTAGACAAACACCACATCCTC 3717 

MMIMMMMMM MIIIMI MIMMMIMM II M MIMM MM 

SBJCT: 3 6 07 GGATTCGAGCTGGACCCTTCCAACCTTGGTGGCTGGTCCCTGGATAAGCACCACACCCTC 3666 



50 QUERY* 3718 AATGTTAAAAGTGGAATCCTACACAAAGGCACTGGGGAAAACCAGTTCCTGACCCAGCAG 3777 

MMI Mill Mill MM M M II II I MIM i I II I II I! M II I M M 1 1! 

SBJCT: 3667 AATGTGAAAAGCGGAATAC TACTCAAAGG CACAGGGGAGAAC CAGTTC CTGAC C CAGCAG 3 726 
QUERY: 3 778 CCTGCCATC^TCACCAGCATC^TGGGC^ATGGTCGCCGCCGGAGCATTTCCTGTCCCAGC 3 83 7 

55 ! | Ml MIMM MM Ml I Mill II I II III I II I Mill II IMMMI II 

SBJCT: 3 727 CCCGCCATCATCACCAGCATCATGGGTAACGGTCGCCGCAGAAGCATCTCCTGTCCCAGC 3 786 
QUERY: 3 838 TGCAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATC 3 897 

Mill I M II I II M II M II M M Ml MIMM Mill IMMMI II Ml 

60 SBJCT: 3787 TGCAATGGCCTTGCTGAAGGCAACAAACTGTTGGCCCCCGTGGCCCTGGCTGTGGGGATC 3 846 
QUERY: 3898 GATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAAT 3957 

MIMMMMM Ml IMIIIIIIIIMI MIM IMMMI II MIIIMI 

SBJCT: 3 847 GATGGGAGCCTCTTTGTCGGTGACTTCAATTATATCCGGCGCATCTTCCCTTCTCGAAAC 3 906 



QUERY: 3958 GTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAACCCAGCACAC 4 017 

MIIIMI IMIIIMMIIIIIIMMIMIIIIIMIMIIIIIII MMI MM 

SBJCT: 3 907 GTGACCAGTATCTTGGAGTTACGAAATAAAGAGTTTAAACATAGCAACAGCCCAGGACAC 3 966 



70 QUERY: 4 018 AAGTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCTACGTGTCCGACACCAACAGC 4077 

59 15966-697 



MIIIIIIMMII I'll; Ml I 1 1 1 M II 1 1 1 1 II II IIIIMIIMI 

SBJCT: 3967 AAGTACTACTTGGCTGTGGACCCTGTGACTGGCTCGCTCTATGTCTCTGACACCAACAGT 4 026 
QUERY: 4078 AGGAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCTGGCTGGGAATTCGGAA 4137 

5 | | Ml I 1IIIMIMII II III IIIMIIIMIIMMIIIMMIM 

SBJCT: 4027 CGCCGGATCTACCGAGTCAAGTCTCTAAGCGGAGCCAAAGACCTGGCTGGGAATTCGGAA 4086 
QUERY * 4138 GTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGGATGGA 4197 

I M M M I Mill 1 1 M M IIIIIIIIIIIIMIIMIIMIIM IMMIII 

10 SBJCT: 4087 GTTGTGGCCGGGACTGGCGAACAATGTCTACCCTTTGATGAAGCCCGCTGTGGGGATGGC 4146 
QUERY • 4198 GGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGG 4257 

MIMMI I II M I M II I M II II M I II I II I M 1 1 II II I M M II II III 

SBJCT: 4147 GGGAAGGCTGTGGATGCCACCCTGATGAGCCCTAGAGGTATTGCAGTAGACAAGAACGGG 42 06 



15 



35 



55 



QUERY: 4258 CTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCC 4317 

M Mill Mill IMIMMMMMMIMMM Mill IMIMMMIMM 

SBJCT: 4207 CTTATGTATTTTGTTGATGCCACCATGATCCGGAAGGTCGACCAAAATGGAATCATCTCC 4266 



20 QUERY: 4318 ACCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATG 43 77 

M M M M 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 II Mill 1 1 1 1 M M I M II II MUM 

SBJCT: 4267 ACCCTGCTGGGCTCCAATGACCTCACAGCTGTCCGACCACTGAGCTGTGACTCTAGCATG 4326 
QUERY- 4378 GATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTGCTGTCAATCCCATGGATAAC 4437 

25 M M IMMIII Mill II Mill Mill MM II Mill MIMMI M 

SBJCT: 43 27 GACGTGGC CCAGGT CCGT C TAGAATGGCCGACAGAC CT TG CGGTCAAC C CCATGGACAAT 43 86 
QUERY- 443 8 TCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCACCGAGAACCACCAAGTCAGC 4497 

Ml MM II II MIMMI MIMMI II IIIIMIIMI MM! 1 1 II II 

30 SBJCT: 4387 TCCCTGTACGTCCTGGAGAACAACGTCATCCTGCGGATCACCGAGAATCACCAGGTCAGC 4446 
QUERY * 44 98 ATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTCACTCAGCAAA 4557 

Mill IMMIII MMMIMIMM Mill Mill IMMIII MIMMI 

SBJCT: 4447 ATCATCGCGGGACGGCCCATGCACTGCCAGGTTCCCGGCATCGACTACTCGCTCAGCAAG 4506 



QUERY* 4558 CTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTC 4617 

II Mill IMMIII Mill Mill Mill MMMIMIMM Mill 

SBJCT: 4507 CTCGCCATCCACTCTGCTCTGGAGTCAGCCAGCGCCATCGCCATTTCTCACACCGGGGTG 4566 



40 QUERY: 4618 CTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTCTACGCCAGGTAACAACCAAC 4677 

IMIIIIIIM Mill II IMIIMMM Mill IMIIIIIIM M MUM 

SBJCT: 4567 CTCTACATCACCGAGACGGACGAGAAGAAGATCAACCGCCTACGCCAGGTCACCACCAAC 4626 
QUERY - 4678 GGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTCAAT 4737 

45 || IMIIIIIIM Mill IMIIIIIIM Mill I II MM MM II Mill 

SBJCT: 4 627 GGAGAGATCTGCCTCTTAGCCGGGGCAGCCTCAGACTGTGACTGCAAAAATGACGTCAAC 4 686 
QUERY: 473 8 TGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATGCCATCTTGAATTCCCCATCA 4797 

MM MMIIMI IMMIII M Mill M IMMMMMMI Mill M 

50 SBJCT: 46 87 TGCATCTGCTATTCGGGAGATGACGCATACGCCACGGATGCCATCTTGAACTCCCCGTCC 4746 
QUERY: 4798 T CCTTAG CTGTAG CT C CAGATGGTAC CATTTAC ATTGCAGAC CTTGGAAATATT CGGAT C 4857 

IMIIIIIIM Mill Mill Mill Mill IMMIII II Mill MUM 

SBJCT: 4747 TCCTTAGCTGTGGCTCCGGATGGCACCATCTACATCGCAGACCTCGGGAATATCCGGATC 4 806 



QUERY: 4 858 AGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATGCCTTCAACCAGTATGAGGCTGCATCC 4917 

IIIIIIIIIIMM Mill lllllllllll II MIIIIMIIIIIMIIMI II 

SBJCT: 4 807 AGGGCGGTCAGCAAAAACAAACCTGTTCTTAACGCGTTCAACCAGTATGAGGCTGCGTCT 4 866 



60 QUERY: 4918 CCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGTGAGC 4 977 

II Mill Mill I II II IMMIII Mill Mill II Mill MUM 

SBJCT: 4867 CCGGGAGAACAGGAACTGTACGTGTTCAACGCCGATGGTATCCATCAGTACACCGTGAGC 4926 
QUERY: 4978 CTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAA 5037 

65 IMMIII IMIIMMM MM I Mill I II II MM MM MM MM II 

SBJCT: 4927 CTGGTGACCGGGGAGTACTTATACAATTTCACCTACAGCGCTGACAATGATGTCACCGAG 4986 
QUERY: 5038 TTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGT 5097 

IMIIMMM II MIMIIMM IMMIII 11 II II I II II M M II 1 1 1 1 M 

70 SBJCT: 4987 TTGAT TGACAACAACGGGAATT C CCT AAAGATCCG C CGGG ACAG CAG TGGCATGCCC CGA 5046 

60 15966-697 



QUERY: 5098 CACCTGCTCATGCCTGACAACCAGATCATCACCCTCACCGTGGGCACCAATGGAGGCCTC 5157 

lillMIIIIIIMM! II 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 MM MM MMIIMI 

SBJCT: 5047 CACCTGCTCATGCCTGATAATCAGATCATCACCCTTACGGTGGGCACCAACGGAGGCCTC 5106 

5 

QUERY: 5158 AAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGACCTATGATGGCAACACTGGG 5217 

INI MINI II IIIMMIIIIIII II IMMIII llllllll MM M 

SBJCT: 5107 AAAGCCGTGTCAACGCAGAACCTGGAGCTGGGCCTCATGACTTATGATGGGAACACTGGA 5166 

10 

QUERY: 5218 CTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTTTCTATGACTATGACCACGAA 5277 

Mill IIMIIIIIMMIIIIMI II 1 1 1 1 II Mill IMIIMIIIIIIIIII 

SBJCT: 5167 CTCCTAGCCACCAAGAGCGATGAAACCGGATGGACAACTTTTTATGACTATGACCACGAG 5226 
15 QUERY: 5278 GGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCACCGGGAAATG 533 7 

IMII llllllll Mill L 1 1 1 1 1 1 1 1 1 J 1 1 i 1 1 1 Mill MIIMIMMIMI 

SBJCT: 5227 GGCCGTCTGACCAATGTGACTCGCCCCACGGGGGTGGTGACCAGCCTGCACCGGGAAATG 5286 
QUERY: 533 8 GAGAAAT CTATTACCATTGACATTGAGAAC TC CAAC CGTGATGATGACGTCACTGTCATC 5397 

20 MM MM M III llllllll II MM MM MM MM I II I MM Ml II 

SBJCT: 5287 GAGAAATCCATCACCGTTGACATTGAGAACTCCAACCGTGATAACGATGTCACTGTGATT 5346 
QUERY: 5398 ACCAACCTCTCTTC^GTAGAGGCCTCCTACACAGTGGTACAAGATCAAGTTCGGAACAGC 5457 

I M M 1 1 1 1 M 1 1 M II MMMIIIIIMI MMIMIIIIIIIMI IMMMM 

25 SBJCT: 5347 ACCAACCTCTCTTCAGTGGAGGCCTCCTACACCGTGGTACAAGATCAAGTGCGGAACAGC 5406 



QUERY: 54 58 TACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGC 5517 

II 1 1 M II II I I II M MUM I II Mill II II II Mill Mill 

30 SBJCT: 5407 TACCAGCTCTGCAGCAACGGGACCCTGCGCGTCATGTACGCCAACGGCATGGGCGTCAGC 5466 



35 



55 



QUERY: 5518 TTCCACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCCCCACCATTGGACGCTGCAAC 5577 

1 1 1 M II II II II II II Mill M MUM IMIMMMMI 1 1 Mill III 

SBJCT: 5467 TTCCACAGCGAGCCCCACGTCCTCGC^GGC^CCCTCACCCCCACCATCGGGCGCTGTAAC 5526 
QUERY: 5578 ATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGTGGCGCCTAAGAAAGGAACAG 5637 

1 1 1 1 M 1 1 1 1 1 llllllll Ml I IMMIII I M II M II II M IMMMM 

SBJCT: 5527 ATCTCCCTGCCCATGGAGAACGGCCTGAACTCCATCGAGTGGCGCCTGAGGAAGGAACAG 5586 



40 QUERY: 563 8 ATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGGTCCATGGAAGAAATCTCTTG 5697 

1 1 1 1 II M II M M I M 1 1 1 1 1 II 1 1 llllllll II II M II Mill II III II 

SBJCT: 5587 ATTAAAGGCAAAGTCACCATCTTTGGGAGGAAGCTTCGGGTCCACGGAAGGAACCTCCTG 5646 
QUERY: 5698 TCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAAGTTC 5757 

45 Miiiiii i mm ii mm ii mil imimmmi mmmmmiimi 

SBJCT: 5647 TCCATTGATTATGACCGAAATATCCGCACTGAGAAGATCTATGACGACCACCGGAAGTTC 5706 
QUERY: 5758 ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGG 5817 

IIIIIIIIIIIIIIMMMIIMIMIMIIIIIIMIII Mill MMMM II 

50 SBJCT: 5707 ACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTGTGGCTCCCCAGCAGTGGA 5766 
QUERY: 5818 CTGGCAGCTGTCAACGTGTCATACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGG 5877 

Mill II Mill II II 1 1 1 1 1 1 1 1 1 1 1 IMIMMMI II M Mill Ml 

SBJCT: 5767 CTGGCGGCCGTCAATGTCTCCTACTTCTTCAACGGGCGCCTGGCCGGCCTCCAGCGCGGG 582 6 



QUERY: 5878 GCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCATCGTGTCCCGCATGTTCGCT 593 7 

IIIIIIIIIIIIIMMMIMI II 1 1 1 1 1 1 1 II Ml II mum 

SBJCT: 582 7 GC CATGAG CGAGAGGACAGACATTGACAAG CAAGGC CGGATTG TGTC CCGAATGTT CG CC 5886 



60 QUERY: 593 8 GACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCATGGTCCTCCTGCTTCAGAGC 5997 

I II M 1 1 M M IMMIII II M II M 1 1 M I M III 1 1 M I Ml M I M I IMIM 

SBJCT: 5887 GACGGGAAAGTCTGGAGCTATTCCTACCTTGACAAGTCCATGGTCCTCCTGCTGCAGAGC 5946 
QUERY: 5998 CAACGT CAGTATATATTTGAGTATGACT CC T C TGAC CG C CTC CTTG C CGT CAC CATG CC C 6 057 

65 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II llllll MM 

SBJCT: 5947 CAGCGTCAGTACATATTTGAATATGACTCCTCTGACCGCCTCCACGCAGTCACCATGCCC 6 006 
QUERY: 6058 AGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCGGCTACATCCGTAATATTTAC 6117 

II II IIIIMMMMIIIIMM MMIMMM IIMMIMM II MINI 

70 SBJCT: 6007 AGTGTCGCCCGGCACAGCATGTCCACGCACACCTCCATTGGCTACATCCGGAACATTTAC 6 066 

61 15966-697 



QUERY: 6118 AACCCGCCTGAAAGCAATGCTTCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTG 6177 

MM I! M 1 1 1 1 1 1 II 1 1 f 1 1 1 1 r 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 ! 1 1 ( 1 1 1 1 E 1 1 1 1 1 1 1 

SBJCT: 6067 AACCCACCGGAAAGCAACGCCTCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTG 6126 

5 

QUERY- 6178 AAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAG 6237 

Mill 1 1 1 1 IIIMIIIII llllllllllllll Mill II MM MINI 

SBJCT: 6127 AAGACGTCTTTCCTGGGCACCGGGCGCCAGGTGTTCTATAAGTACGGAAAACTGTCCAAG 6186 
10 QUERY: 623 8 TTAT CAGAGATTGT CTACGACAGTAC CGC CGTCAC CTT CGGGTATGACGAGACCAC TGGT 6297 

Mill Mill IIIIMIIIII II IMIIIMMIMI IMIIIIIIIMIMII 

SBJCT: 6187 TTATCGGAGATCGTCTACGACAGCACTGCCGTCACCTTCGGCTATGACGAGACCACTGGC 6246 
QUERY • 6298 GTCTTGAAGATGGTCAACCTGCAAAGTGGGGGCTTCTCCTGCACCATCAGGTACCGGAAG 6357 

15 I M 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 llllllllllllll llllllllllllll III 

SBJCT: 6247 GTC CTGAAGATGGTGAATCT C CAAAG CGGGGGC TT CT C C TGTAC CATCAGGTAC CGAAAG 6306 
QUERY: 6358 ATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCGAGGAAGGCATGGTCAATGCC 6417 

I II Mill II II I M 1 1 M 1 1 1 IN: IMMMMIM MM III 

20 SBJCT: 6307 GT CGGGC C CCTCGTGGACAAG CAGAT T TACAGGTT C T C TGAGGAAGGCATGAT CAACGC C 63 66 
QUERY: 6418 AGGTTTGACTACACCTAT^TGACAAC^GCTTCCGCATCGC^GC^Ta^GCCCGTCATA 6477 

Mill II IMIMM M MMIIIMIIMIIMIM IMIIIIMMIIIIM 

SBJCT: 6367 AGGTTCGATTAC^CCTACCA.CGACAACAGCTTCCGCATCGCCAGCATCAAGCCCGTCATC 6426 
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QUERY: 6478 AGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAA 6537 

MMIIIMIMM IMMMIIMMMIMM IMMMMMMMMMIMM 

SBJCT: 6427 AGTGAGACTCCCCTTCCCGTTGACCTCTACCGCTACGATGAGATTTCTGGCAAGGTGGAA 64 86 



30 QUERY: 6538 CACTTTGGTAAGTTTGGAGTC^TCTATTATGAC^TCAACCAGATCATCACCACTGCCGTG 6597 

MMI M Mill M llllllll II M II 1 1 M 1 1 II M II I II 1 1 II M M M 

SBJCT: 6487 C^CTTCGGC^GTTCGGGGT»TCTACTACGACATG^CCAGATCATC^CCACTGCCGTC 6546 
QUERY: 6598 ATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCAAGGAGGTCCAGTATGAGATG 6657 

35 Mill IMIMM MMI IMMMMIMM Ml MMI II MUM Ml III 

SBJCT: 6547 ATGAC ACTCAGCAAG CACTTTGACACC CATGGG CGCAT CAAGGAAG TGCAGTATGAGATG 6606 
QUERY: 6658 TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGCATGGGCAGGGTGATCAAG 6717 

MMIIIIIIIIIMMIIMIIMIIIMMIIIMMM Mill I MUM 

40 SBJCT: 6607 TTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACAGTATGGGCAGGGTCATCAAG 6666 
QUERY: 6718 AGGGAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGTACACCTATGACTACGATGGG 6777 

Mill M IMMMIIMMMIMM Mill II I II I II 1 1 1 1 II II II M Ml 

SBJCT: 6667 AGGGAACTGAAACTGGGGCCCTATGCCAACACCACAAAGTACACCTATGACTACGACGGG 6726 



QUERY: 6778 GACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGACCTGGCGCTACAGCTATGAC 683 7 

Mill iimmm miiimmiiiii n iiiiiin n imiimii 

SBJCT: 6727 GACGGCCAGCTCCAGAGTGTGGCCGTCAATGACCGGCCTACCTGGCGTTATAGCTATGAC 6786 



50 QUERY: 6838 CTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCCCTTGCGC 6897 

II llllllll II III I II llllllll Nil IIIIMIIIII II III 

SBJCT: 6787 CTCAATGGGAACCTGCACCTGCTAAACCCAGGAAACAGTGCTCGCCTCATGCCGTTACGC 6846 
QUERY: 6898 TATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGAT 6957 

55 III MM MM II II MM Mill II Mill MM Ml MM III II II III 

SBJCT: 6847 TATGACCTCCGTGACCGGATAACCAGGCTAGGGGACGTGCAGTACAAAATCGATGATGAT 6 906 
QUERY: 6958 GGCTATCTGTGCCAGAGAGGGTCTGACATCTTCGAATACAATTCCAAGGGCCTCCTAACA 7017 

MMM I IMIMIMM MMIIMIII IMIMM IMIMM 1 1 II I 

60 SBJCT: 6907 GGCTAT TTATG CCAGAGAGGAT CTGACAT CTT TGAATACAACT CCAAGGGC CTT CTAACG 6966 
QUERY: 7018 AGAGCCTACAACAAGGCCAGCGGGTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGG 7077 

Mill IMIIIIIIIIIIMIIIIMIIIII MIIMIIIIIIIIIIIMI I II 

SBJCT: 6967 AGAGCGTACAACAAGGCCAGCGGGTGGAGTGTGCAGTACCGCTATGATGGCGTGAGCCGC 7026 



QUERY: 7078 CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTGCAGTACTTCTACTCTGACCTCCAC 7137 

IIIMIIMMIIIIIIIIIIIIIMIIMIIMI MMIIMIII II MIIIIMI 

SBJCT: 7027 CGGGCTTCCTACAAGACCAACCTGGGCCACCACCTACAGTACTTCTATTCCGACCTCCAC 7086 

62 15966-697 



QUERY: 713 8 AACCCGACGCGCATCACCCATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTAC 7197 

MM M M MMMMMI MMI MMMMMI MM! MMMM Ml 

SBJCT: 7087 CACCCCACACGTATCACCCATGTTTACAACCACTCCAACTCTGAGATCACCTCACTCTAC 7146 
5 QUERY: 7198 TACGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTT 7257 

M MMMMMMMMMMMMMMMIMM MMMM MMMMMM 

SBJCT: 7147 TATGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGTAGTGGGGAAGAGTACTATGTT 7206 
QUERY: 7258 GCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAA 7317 

10 MMI MMMM IMIMMMMMIM M II MMI I II MM I Ml Ml 

SBJCT: 7207 GCCTCAGATAACACCGGGACTCCTCTGGCTGTTTTTAGTATCAATGGCCTCATGATCAAG 7266 
QUERY: 7318 CAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATG 73 77 

II II M Mill MIMMMMMMM MMMMMI II MMI MUM 

15 SBJCT: 7267 CAACTCGAATACACAGCCTATGGGGAGATTTACTATGACTCCAATCC^GACTTTCAGATG 7326 
QUERY: 7378 GTCATTGGCTTCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAG 7437 

Mill MMMM M II MMI MMMM llllllll II Mill II III 

SBJCT: 7327 GTCATCGGCTTCCACGGAGGCCTCTACGACCCCCTCACCAAGCTCGTTCACTTTACGCAG 7386 
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QUERY: 743 8 CGTGATTATGATGTGCTGGCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAAC 7497 

lllllllllll Illlllllllllll Mill MMI HIM I M 1 1 1 1 II I II 

SBJCT: 73 87 CGTGATTATGACGTGCTGGCAGGACGGTGGACGTCCCCCGACTACACCATGTGGAGGAAT 7446 



25 QUERY: 74 98 GTGGGCAAGGAG C CGG C C C C C TT TAAC CTGTATATGTT CAAGAGCAACAAT CC TCTCAGC 7557 

Illlllllllllll MMMM llllllll llllllllll MIMMII lllll 

SBJCT: 7447 GTGGGCAAGGAGCCAGCCCCCTTCAACCTGTACATGTTCAAGAACAAC 7506 
QUERY: 7558 AGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGA 7617 

30 | | Mill Mill Mill I MMMMMI Mill IIIIIIM I Mill Ml III 

SBJCT: 7507 AATGAGCTGGAT TTAAAGAAC TACGTGACAGACGTGAAGAGC TGGCT CGTGATGTTTGGA 7566 
QUERY: 7618 TTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCT 7677 

MMMM IIIIMIIIIMIIMI IMM I M II 1 1 1 II M 1 1 1 II MMMM 

35 SBJCT: 7567 TTTCAGCTCAGCAAC^TCATTCCTGGATTCCCAAGAGCCAAAATGTATTTTGTGCCTCCC 7626 
QUERY: 7678 CC CTATGAATTGTCAGAGAGT CAAGCAAGTGAGAATGGACAGCT CATTACAGGTGT C CAA 773 7 

II I M II M MIIMIMI MMMMMMMMIMMMMMMMMIMM 

SBJCT: 7627 CCCTATGAACTGTCAGAGAGCCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAG 7686 



QUERY: 773 8 CAGACAACAGAGAGACATAACCAGGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAA 7797 

MMMMMI! IMMMMMIM MMMI MMMMMM Mill 

SBJCT : 76 87 CAGACAACAGAGAGGCATAACCAGGCCTTTCTGGCTCTAGAAGGACAGGTCATCTCTAAA 774 6 



45 QUERY: 7798 AAGCTCC^CGCCAGCATCCGAGAGAAAGCAGGTC^CTGGTTTGCCACCACC^CGCCCATC 7857 

MMMM II IIIIMMIMMIIMM MMMMMI II MMMMMM 

SBJCT: 7747 AAGCTCCATGCAGGCATCCGAGAGAAAGCAGGCCACTGGTTTGCTACGACCACGCCCATC 7806 

QUERY: 7858 ATTGGCAAAGGCATCATGTTTGCCATCAAAGAAGGGCGGGTGACCACGGGCGTGTCCAGC 7917 

50 || MMMM MMMM II II I II 1 1 II II I M M 1 1 M I II I llllllll III 

SBJCT: 7807 ATCGGC^^GGC^TCA.TGTTCGCC^TCAAAGAAGGGCGGGTGACCACAGGCGTGTCTAGC 7866 

QUERY: 7918 ATCGCCAGCGAAGATAGCCGCAAGGTGGCATCTGTGCTGAACAACGCCTACTACCTGGAC 7977 

MMMM M II IIIIIIIMM IMM III IMIMMMMMIM Mill 

55 SBJCT: 7867 ATCGCCAGTGAGGACAGCCGCAAGGTAGCATCCGTGTTGAACAACGCCTACTACTTGGAC 7926 

QUERY: 7978 AAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTGAAGATTGGCTCAGCC 8 03 7 

MIMMMIIM IIMIMMIIMM Ml MMMM MMMM M MM 

SBJCT: 7927 AAGATGCACTACAGCATCGAGGGCAAGGACACACACTACTTCGTGAAGATCGGTGCAGCG 7986 



QUERY: 8038 GATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAGAGAGCGGGGTG 8097 

II II llllllll II M M MMMM II MMMMMI MMMMMM 

SBJCT: 7987 GACGGTGACCTGGTTACGCTGGGGACCACCATTGGGCGCAAGGTGCTGGAGAGCGGGGTG 8046 



65 QUERY: 8098 AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAAC 8157 

MMMMIIMM IMMMIMMIMM MMMMIIMIMIMIMM Ml 

SBJCT: 8047 AACGTGACCGTGTCACAGCCCACGCTGCTGGTGAACGGCAGGACTCGAAGGTTCACCAAC 8106 
QUERY: 8158 ATTGAGTTCCAGTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACC 8217 

70 mil II MM MUM Mil MMMI Mil II Mill II 1 1 1 1 II 1 1 M 1 1 1 II 

63 15966-697 



SBJCT : 



8107 ATTGAATTCCAGTACTCCACGCTGCTGCTCAGCATACGCTACGGCCTCACCCCCGACACA 8166 



10 



30 



45 



50 



55 



65 



70 



QUERY: 8218 CTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGGCGAGACAGAGGGCCCTGGGCACGGCC 8277 

Mill imilMIIMIMIIIIIIIIIII Ml 1 1 1 1 1 1 1 i i 1 1 1 1 1 i I I 1 1 1 

SBJCT: 8167 CTGGATGAAGAGAAGGCCCGCGTCCTGGACCAAGCGCGACAGAGGGCCGTGGGTACTGCC 8226 
QUERY: 8278 TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCCTGTGGACTGAG 8337 

IIMIIIMNIIIIIIIIIIIIIIIIIIIIIIIIIMIII Mill IIMIIII III 

SBJCT: 8227 TGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGCAGCCGTCTGTGGACGGAG 8286 
QUERY: 8338 GGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGGGATATTACGTG 8397 

MIMIIIMIIII II IIIIIMI II II MINIM M Mill IIIIIMM 

SBJCT: 8287 GG CGAGAAG CAG CAACTC CTGAG CACGGGACGGGTGCAAGGTTATGAGGG CTATTACGTG 8346 



15 QUERY • 83 98 CTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTTTAAGA 8457 

Mill 1111! II IIIIIMMM IIIIMIIIII1MIIMIMI1III IIMM 

SBJCT: 8347 CTTCCGGTGGAACAGTACCCAGAGCTGGCAGACAGTAGCAGCAACATCCAGTTCTTAAGA 8406 
QUERY: 8458 CAGAATGAGATGGGAAAGAGGTAACAAAATAATCTGCTGCCATTCCTTGTCTGAATGGCT 8517 

20 ! 1 1 M ! I i I M I i I ! I i 1 1 1 1 1 1 1 1 1 i I M 1 1 M 1 ! I f t II II! MM Mill 

SBJCT: 8407 C^GAATGAGATGGGAAAGAGGTAACAAAATAACCTGCTGCCACCTCTTCTCTGGGTGGCT 8466 
QUERY: 8518 CAG CAGGAGTAACTGTTAT CT CCTCTC CTAAGGAGATGAAGAC CTAACAGGGG CAC TG CG 8577 

IIIMIMl IIMM I 1 1 f 1 1 1 M 1 M 1 1 1 M I M 1 1 1 M M 1 1 II M M 1 1 M I 

25 SBJCT: 8467 CAGCAGGAGCAACTGTGACCTCCTCTCCTAAGGAGACGAAGACCTAACAGGGGCACTGAG 8526 
QUERY: 8578 GCTGGGCTGCTTTAGGAGACCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 863 7 

M I M II 1 1 1 1 1 1 II I IMMMMMIIMIIMMIIMMIIIMIMIIIMI 

SBJCT: 8527 GCCGGGCTGCTTTAGGACCCCAAGTGGCAAGAAAGCTCACATTTTTTGAGTTCAAATGCT 85 86 



QUERY: 8638 ACTGTCCAAGCGAGAAGTCCCTCATCCTGAAGTAGACTAAAGCCCGGCTGAAAATTCCGA 8697 

IMMIMMII 1 1 1 1 1 f I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 111 MM MIMI M 

SBJCT: 8587 ACTGTCCAAGCGCAAAGTCCCTCATCCTGAAGTAGACTAGAGCTCGGCCACAAATTCTGA 8646 



35 QUERY: 8698 GGAAAACAAAAC 8709 

IMMIMMII 

SBJCT: 8647 GGAAAACAAAAC 8658 

SCORE = 1459 BITS (736), EXPECT = 0.0 
40 IDENTITIES = 1081/1196 (90%) 

STRAND = PLUS / PLUS 



QUERY : 


270 


SBJCT : 


123 


QUERY: 


330 


SBJCT : 


183 


QUERY : 


390 


SBJCT : 


243 


QUERY : 


450 


SBJCT : 


303 


QUERY : 


510 


SBJCT : 


363 


QUERY: 


570 


SBJCT : 


423 


QUERY: 


630 


SBJCT : 


483 



MM MINIMUM INN IIIMNI NINNMNIM Mill INN 

rCTGCAATAATGGATGTGAAGGATCGGCGACATCGCTCTTTGACCAGGGGACGGTGTGG 
\AAGAGTGTCGCTACACAAGCTCCTCTCTGGACAGTGAGGACTGCCGGGTGCCCACACA 

11 II 1 11 1 M 1 1 1 II I MMMMMMIMIMMMMIMM IIIIMM II 

^AGGAGTGTCGCTACACCAGCTCCTCTCTGGACAGTGAGGACTGCCGTGTGCCCACGCA 
^AATCCTACAGCTCCAGTGAGACTCTGAAGGCCTATGACCATGACAGCAGGATGCACTA 

II IIMIIII MM. Ml I IIIIMM IMIMMMIIMIII IIIIMM 

^AGTCCTACAGTTCCAGTGAGACCCTGAAGGCTTATGACCATGACAGCAGAATGCACTA 
3GAAACCGAGTCACAGACCTCATCCACCGGGAGTCAGATGAGTTTCCTAGACAAGGAAC 

IMIIMIMMIIMIIM I MIIMIMM IMIIIMI IIIMIMM I 

3GAAACCGAGTCACAGACCTGGTGCACCGGGAGTCCGATGAGTTTTCTAGACAAGGGGC 
AACTTCACCCTTGCCGAACTGGGCATCTGTGAGCCCTCCCCACACCGAAGCGGCTACTG 

60 " \\ HIM III II Ml I MM MM I M I M I M 1 1 1 i II I II 1 1 II Mill 

^ATTTCACCCTGGCAGAATTGGGAATCTGCGAGCCCTCCCCACACCGAAGTGGTTACTG 
rCCGACATGGGGATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTC 

II II N N II II 1 1 II I II 1 1 1 1 II II II 1 1 II I Mill 1 1 E 1 1 1 1 1 II INN 



ININIIIIIINIIII IMIMIMMIMMI IIMIIINI 

iAGGGATGT CT C CAGAACATGC CAT CAGACTGTGGGGACGAGGGATAAA 542 

64 15966-697 



QUERY: 690 ATCCAGGCGCAGTTCCGGCCTGTCCAGTCGTGAAAACTCGGCCCTTACCCTGACTGACTC 749 

III IIIIIIII II III IIIIIII II II Mill IIIIIIII IIIIIIII II 

SBJCT: 543 ATCGAGGCGCAGCTCTGGCTTGTCCAGCCGCGAGAACTCAGCCCTTACTCTGACTGATTC 602 



5 


QUERY : 


750 


TGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCATTCC^ 


809 




SBJCT : 


603 


Mill Mill Mill Mill II II Mill MMIMMIIMMIIMM 1 

TGACAATGAAAATAAAT CGGATGACGACAATGGTCGACC CATTC CAC C TACAT CCT CG T C 


662 


10 


QUERY : 


810 


TAGTCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 


869 


SBJCT : 


663 


Ml M!Mllllll!llll!l!lllllll!ltlllllflll!illMI!1i!IMM!l 

TAGCCTCCTCCCATCTGCTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTAGCTGCCA 


722 




QUERY : 


870 


GATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCC 


929 




QT3.TPT * 




MIIIMIMIIMIMMIIIMMMIMM IMMMMIMIIMI IIIIIIII 

GATGCCATTGCTAGACAGCAACACCTCCCATCAGATCATGGACA^^ 


782 




yUEiJXI . 


J J u 


ATTPTPPPPPAATTPATAPPTGCTCAGAGCATGCTCAGGGCCCCAGC^^ 

IMMMII IIIIMIIIMIIIIIIIIIIIIIMIIIIIIIIIIIMIIIMIM II 

ATTCTCCCCTAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGTAG 


989 


20 


SBJCT : 


783 


842 


QUERY : 


990 


TGGCCCTCCGAACCACCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACCCCCTCACAA 


1049 




SBJCT : 


843 


IIIIIIIMIIIIIIIIIIIIIIIIII II MMMMIIMM Mill Mill II 

TGGCCCTCCGAACCACCACAGCCAGTCAACGCTGAGGCCCCCTCTGCCACCTCCTCATAA 


902 


25 


QUERY : 


1050 


CCACACGCTGTCCCATCACCACTCGTCCGCCAACTC^ 


1109 




QT1.TPT * 




MUM IMIIMI M II II M II IIIIIIIIIIIMIIII 1 i 1 1 1 1 1 1 1 1 i I f i 

C CACAC C C TGTCC CACCAC CACT CCTCTGCCAACT CCCT CAACAGAAACTCAC TGAC CAA 


962 


30 


HTTPDV • 


1 1 1 ft 


t raa raa a dT r a a a tp r apg p p p pggpc cpag cg c c CAATGAC CTGG C CAC CACAC CAGA 


1169 


QT3.TPT • 
orSU \~ 1 : 




IMMMIMM IIIMIIMM II II II IMIMMIIMMIIMM II M 

t raaraa a cxt p a a a t p p a pg p c p CAGCTCC TGCAC C CAAT GAC CTGGC CACCACGC cgga 


1022 






J. x / u 


GTC CGTT CAGCTT CAGGACAG CTGGGTGCTAAACAGCAACG TGCCACTGGAGAC CCGGC A 


1229 


35 




1023 


IIIIIMIMM MIMIIMMMMII Mill IMIIMI IMIIMI Mill 

GTCCGTTCAGCTCCAGGACAGCTGGGTGCTGAACAGTAACGTGCCGCTGGAGACGCGGCA 


1082 




QUERY : 


123 0 


CTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGATA 


1289 


40 


0.00 1 . 




IIMIMMIIMM Mill II lllllllll IMMIMIMIMM II Mill 

PTTP PT PTT PAAGACGTC CT CCGGAAG CACAC CC CTGTT CAG CAG CT CT T C T CCAGGATA 


1142 




1 Ton 


rrnTTTPAPPTPAGaAAPGOTTTAPJVCGCCCCCGCCCCGCCTGCTGCCCAGGAAT^ 


1349 








111 I1I1IIIIMI 1 1 Mill 1 1 II M IIIIMIIIMIM MMMI II 

CCCCTTGACCTCAGGGACCGTTTATACACCACCACCCCGCCTGCTGCCACGGAATACATT 


1202 


45 


QUERY : 


1350 


CTCCAGGAAGGCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGC 


1409 




SBJCT : 


1203 


III IMIIMI IIIIMMMIMI IMMIIIIMMIMI MMMI! 1! 1 1 

CTCTAGGAAGGCCTTCAAGCTGAAGAAACCCTCCAAATACTGCAGTTGGAAATGCGCCGC 


1262 


50 


QUERY : 


1410 


CCTCTCCGCCATTGCCGCGGCCCTCCTCTTGGCTATTTTGCTGGCGTATTTCATAG 1465 


SBJCT : 


1263 


111 II IMMMIIM 11 II 1 1 1 1 MM MIMMIMI IMIIMMI 

CCTGTCTGCCATTGCCGCTGCCCTCCTTCTGGCCATTTTGCTGGCCTATTTCATAG 1318 


55 


SCORE = 1427 BITS (720), EXPECT = 0.0 
IDENTITIES - 996/1088 (91%) 
STRAND = PLUS / PLUS 






QUERY : 


1464 


AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCG 


1523 


60 


SBJCT : 


1440 


MMMMMMMIIMIMMIIMMMIIMI II II IIIIIIII Mill II 

AGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGCGGCGAGGCAGAAGTCGGTCGACG 


1499 


QUERY: 


1524 


GGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATT^ 


1583 




SBJCT : 


1500 


III Mill MMMMMIMMMMMIMIMM 11 IMMIMMMMIM 

GGTGACACAGGAAGTCCCACCAGGGGTGTTTTGGAGGTCCCAGATTCACATCAGTCAGCC 


1559 


65 


QUERY : 


1584 


CCAGTTCTTAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAG 


1643 




SBJCT : 


1560 


IMIIIIIIIMIIIIIMIIIIIII IIIIMII M Mill II II IIIIIIII 

TCAGTTCTTAAAGTTCAACATCTCCCTGGGGAAGGATGCCCTCTTCGGCGTCTACATAAG 


1619 



QUERY: 1644 AAGAGGACTTCCACCATCTCATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGA 1703 
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SBJCT: 1620 AAGAGGACTGCCACC^TCTCATGCACAGTATGACTTCATGGAACGCCTGGACGGAAAGGA 1679 

QUERY* 1704 GAAGTGGAGTGTGGTTGAGTCTCCCAGGGAACGCCGGAGCATACAGACCTTGGTTCAGAA 1763 

llllllililll Ml Mill MIMIMMIIIIIIMII llllll 1 1 1 1 Mill 

5 SBJCT: 1680 GAAGTGGAGTGTGGTCGAGTCACCCAGGGAACGCCGGAGCATCCAGACCCTGGTGCAGAA 173 9 



QUERY * 1764 TGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGA 1823 

II II Mill IIIIMIM IMMIMMMMMIM II I M 1 1 II M M 1 1 i 

SBJCT: 174 0 CGAGGCTGTGTTCGTGCAGTACTTGGATGTGGGCCTGTGGCACCTCGCCTTCTACAATGA 1799 

10 

QUERY: 1824 TGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTG 1883 

II 1 1 MIM IMIIMI IMMMIMI MMM 1 1 1 M I f M 1 1 1 1 1 M M I 

SBJCT: 1800 CGGCAAGGACAAGGAGATGGTCTCCTTCAATACGGTTGTCTTAGATTCAGTGCAGGACTG 1859 
15 QUERY: 1884 TCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGG 1943 

IMMI IIMIMI Mill II Mill Mill M I! II 1 1 II t M M i 1 1 1 1 1 

SBJCT: 1860 TCCACGAAACTGCCACGGGAACGGCGAATGCGTGTCTGGACTGTGTCACTGTTTCCCAGG 1919 
QUERY: 1944 ATTTCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACA 2003 

20 Ml II Ml Ml Mill IIMIMI MM IIMIMI I II 1 1 II M 1 1 II I II I II I 

SBJCT: 1920 ATTCCTAGGTGCAGACTGCGCTAAAGCTGCCTGCCCTGTTCTGTGCAGTGGGAATGGACA 1979 



QUERY: 2004 ATATTCTAAAGGGACGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCC 2063 

Mill IIMIMI IMMMMMMI IMMMIMI MM! MIM I M II 

25 SBJCT: 198 0 GTATTCCAAAGGGACATGCCAGTGCTACAGTGGCTGGAAAGGAGCAGAATGCGATGTGCC 203 9 



QUERY- 2064 CATGAATCAGTGCATCGATCCTTCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2123 

MMM II I II 1 1 M 1 1 1 II M II 1 1 IIIIIIIMMIIIIMMIIIMIMMMI 

SBJCT: 2040 CATGAACCAGTGCATCGATCCTTCCTGTGGGGGCCACGGCTCCTGCATTGATGGGAACTG 2099 

30 

QUERY - 2124 TGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCAC 2183 

II II I MMMIMM IMMMIMI II MIM MMIMMIMM II 

SBJCT: 2100 CGTGTGTGCAGCTGGCTACAAGGGCGAGCACTGCGAAGAAGTGGATTGCTTGGATCCAAC 2159 
35 QUERY- 2184 CTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGGTGG 2243 

IMIIIIIIIII II MMMIMM MIM II II IIMIMI IIMIMI II 

SBJCT: 2160 CTGCTCCAGCCATGGTGTCTGTGTGAACGGAGAGTGTCTATGCAGCCCCGGCTGGGGCGG 2219 



QUERY : 2244 TCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTA 23 03 

40 || MIM IIMIIMIMIIIIIIMIMMIIMMI! I! Mill II Mill II 

SBJCT: 2220 GCTCAACTGCGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGTAGTGGGCATGGCACTTA 2279 
QUERY: 2304 CCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGT 2363 

III llllll I IMIIMIII Ml Mill II I i M 1 1 1 1 1 1 i I i 1 1 1 i 1 1 1 E i I 

45 SBJCT: 2280 CCTCCCTGACTCTGGCCTCTGCAACTGTGATCCGAATTGGATGGGTCCCGACTGCTCTGT 233 9 



QUERY* 2364 TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2423 

III Ml MMMIMM MMMIMM MMMIMM MM M i 1 1 1 1 II 1 J ! 1 1 1 E t 

SBJCT: 2340 TGAAGTGTGCTCAGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTG 2399 

50 

QUERY: 2424 TGAAGAGGGCTGGACAGGCGCAGCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 24 83 

1 1 II I M 1 1 1 1 II 1 1 1 II I II II IIMMMIMMMMIMMMIMMIIMM 

SBJCT: 2400 TGAAGAGGGCTGGACAGGCGCGGCTTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGA 24 59 
55 QUERY: 2484 GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2543 

I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 M 1 1 1 1 1 1 1 1 1 M II II I M II 1 1 1 1 1 M 1 1 II I 

SBJCT: 2460 GCACGGGACCTGTAAAGATGGCAAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTG 2519 



QUERY: 2544 CACCATTG 2551 



SBJCT: 2520 CACCATTG 2527 



In this search it was also found that the FCTR3bcd and e nucleic acid had homology to 
six fragments of Gallus gallus partial mRNA for teneurin-2. It has 2780 of 3449 bases (80%) 
65 identical to bases 3386-6834, 1553 of 1862 bases (83%) identical to bases 1414-3275, 540 of 



66 



15966-697 



628 bases (85%) identical to bases 587-1214, 593 of 725 bases (81%) identical to bases 7084- 
7808, 429 of 515 bases (83%) identical to bases 7895-8409, and 397 of 475 bases (83%) 
identical to bases 20-494 of Gallus gallus partial mRNA for teneurin-2. (EMBL Acc: 
GGA278031) (Table 30). 

5 Table 3(X BLASTN of FCTR3b, c, d, and e against Gallus gallus Teneurin-2 mRNA (SEQ 

IDNO:67) 

> GI [10241573 | EMB jAJ279031.il GGA279Q31 GALLUS GALLUS PARTIAL MRNA FOR TENEURIN-2 (TEN2 
GENE) , LONG SPLICE 
VARIANT 

10 LENGTH - 8409 

SCORE = 1532 BITS (773), EXPECT = 0.0 
IDENTITIES = 2780/3449 (80%) 
STRAND = PLUS / PLUS 



15 



35 



55 



QUERY: 3458 TGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGG 3 517 

MII1M 1 1 II I! MINIM II II II Mill I III IMIIIIM III 

SBJCT: 3386 TGATGGTAGCAGTAGAAGGGCATCTATTTCAAAAATCATTTCTGGCATCTCCCAACTTGG 3445 



20 QUERY: 3518 CCTCCACCTTCA.TCTGGGAGAAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAG 3577 

I I II 1 1 1 1 1 1 1 1 1 1 1 M I MIIIMI Mill M I III Mill I III! 

SBJCT: 3446 CTTATACATTCATCTGGGACAAAACAGATGCATATGGTCAGAAGGTTTATGGGTTGTCAG 3 505 
QUERY* 3578 ATGCTGTTGTGTCTGTCGGGTTTGAATATGAGACCTGTCCCAGTCTAATTCTCTGGGAGA 3 637 

25 MMIM II Mill M 1 1 1 1 1 1 1 1 1 1 1 II 1 III MINI I Mill III MM 

SBJCT: 3506 ATGCTGTAGTTTCTGTGGGTTTTGAATATGAGACTTGTCCCAGTTTGATTCTGTGGGAGA 3565 
QUERY: 3 63 8 AAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCC 3697 

MMIM II II II II Mill II M I II II Mill II II II Mill 

30 SBJCT: 3 566 AAAGGACTGCGCTGCTGCAAGGATTTGAGCTAGATCCTTCCAATCTAGGAGGATGGTCTT 3625 
QUERY: 3 698 TAGACAAACAC CACAT C C TCAATGTTAAAAG TGGAATC C TACACAAAGGCACTGGGGAAA 3 757 

I II Mill II I M Mill II Mill II I IIIIIIIIM III MM 

SBJCT: 3 626 TGGATAAACATCATGTACTGAATGTCAAGAGTGGTATATTGCACAAAGGCAATGGAGAAA 36 85 



QUERY: 3 758 ACCAGTTCCTGACCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCC 3 817 

I Mill II II MIIIMI M I II MIIIMI Mill Mill Mill 

SBJCT: 3 686 ATCAGTTTCTAACTCAGCAGCCAGCTGTGATAACCAGCATTATGGGGAATGGGCGCCGAA 3 745 



40 QUERY: 3 818 GGAGCATTTCCTGTCCCAGCTGCAACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAG 3877 

I Mill IMIIMI IIIIMM II Mill Mill II Mill MMIM I 

SBJCT: 3 746 GAAGCATATCCTGTCCTAGCTGCAATGGTCTTGCAGAAGGAAATAAGCTTTTGGCCCCTG 3805 
QUERY: 3878 TGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTATGTGGGTGACTTCAATTACATCCGAC 3937 

45 Ml Mill II Mill Mill MMIM Ml M II II MIIIMI M I 

SBJCT: 3 806 TAGCACTGGCAGTGGGAATTGATGGAAGCCTCTTTGTTGGAGATTTTAATTACATTCGGC 3865 
QUERY: 3938 GCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGAAATAAAGAGTTTAAAC 3 997 

I Mill 1 1 M I MIIIMI Mill MUM I I i I i I M i I i It I II II I 

50 SBJCT: 3 866 GTATCTTCCCATCCAGGAATGTGACTAGCATATTGGAGCTGAGAAATAAAGAGTTTAAAC 3 925 
QUERY: 3 998 ATAGC^CAACCCAGCACAC^GTACTACTTGGCAGTGGACCCCGTGTCCGGCTCGCTCT 4057 

II II II II 1 1 II M Mill Mill MM MIIMIMM 1 1 Mill II I 

SBJCT: 3926 ATAGCAACAATCCTGCTCACAAATACTATCTGGCCGTGGACCCCGTTTCGGGCTCCCTGT 3985 



QUERY : 4 058 ACGTGT C CGACAC CAACAG CAGGAGAAT C TAC CGCGT CAAGTCT CTGAGTGGAAC CAAAG 4117 

ill! ii iiiiiimiii i i ii in inn inn i mi ii mi 

SBJCT: 3 986 ACGTATCAGACACCT^CAGCCGACGGATATACAAAGTCAAATCTCTTACTGGCACGAAAG 404 5 



60 QUERY: 4118 ACCTGGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATG 4177 

IMMIIIM Mill Mill II II Mill IMMMI II M IIIMIIMI 

SBJCT: 4046 ACCTGGCTGGTAATTCTGAAGTGGTAGCGGGGACTGGAGAGCAATGCCTGCCCTTTGATG 4105 



67 
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QUERY: 4178 AAGCCCGCTGCGGGGATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTA 4237 

IMM I II 11 1IIIIIMIM I! I II IMIIIM Mlil II till I 

SBJCT: 4106 AAGCCAGATGTGGAGATGGAGGGAAAGCAGTGGACGCAACCCTAATGAGTCCTCGAGGAA 4165 
5 QUERY * 423 8 TTGCAGTAGACAAGAATGGGCTCATGTACTTTGTCGATGCCACCATGATCCGGAAGGTTG 4297 

1 1 1 1 II I II III 1 1 1 1 llllllll IMM IMIIIM IMM 1 1 II II I 

SBJCT: 4166 TTGCAGTGGATAAGTATGGACTCATGTATTTTGTTGATGCCACTATGATTCGAAAAGTGG 4225 
QUERY: 4298 ACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAATGACCTCACTGCCGTCCGGCCGC 4357 

10 | M M II II II I M II II III MM MM MM IMM MM I III I II II I 

SBJCT: 4226 ATCAGAATGGAATTATATCAACTCTGCTGGGCTCCAATGACCTAACTGCCGTCCGACCTC 4285 
QUERY: 4358 TGAGCTGTGATTCCAGCATGGATGTAGCCCAGGTTCGTCTGGAGTGGCCAACAGACCTTG 4417 

I MMMMMMMMMIMM MMM M IMIIIIMM II M II I 

15 SBJCT: 42 86 TAAGCTGTGATTCCAGCATGGATGTCAGCCAGGTACGGCTGGAGTGGCCTACTGATCTCG 4345 
QUERY: 4418 CTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATCA 4477 

Mill IMIMIMI Mill I Mill II II 1 1 II I M II I II I M MM 

SBJCT: 4346 CTGTCGATCCCATGGACAACTCACTTTATGTCCTAGAGAACAATGTTATTTTACGGATCA 4405 



20 



40 



60 



QUERY * 44 78 CCGAGAACCACC^GTC^GC^TCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCA 453 7 

I M Mill Mill Mill Mill I i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I IMIIIM I 

SBJCT: 44 06 CAGAAAACCATCAAGTTAGCATTATTGCTGGACGCCCCATGCACTGCCAGGTTCCTGGTA 4465 



25 QUERY: 4 53 8 TTGACTACT»CTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTG 4597 

I IMIIIM II I IN II Mill li II M M II II II I II II M M I 

SBJCT: 4466 TAGACTACTCTCTTAGCAAACTGGCTATTCATTCCGCACTTGAATCAGCCAGTGCCATTG 4525 
QUERY* 4598 CCATTTCTCACACTGGGGTCCTCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTC 4657 

30 | Ml II Mill M II II MM Ml Mill MM Ml II II Mill III 

SBJCT: 4526 CCATCTCACACACAGGAGTTCTTTACATCAGTGAGACAGATGAAAAAAAAATTAATCGGC 4585 
QUERY: 4 658 TACGCCAGGTAACAACCAACGGGGAGATCTGCCTTTTAGCTGGGGCAGCCTCGGACTGCG 4717 

I II M I II 1 1 II I Mill M II II MMM I II MIMIII M Mill I 

35 SBJCT: 4 586 TACG CCAGGTAACTAC CAATGGAGAAATATGC CT TCTTG CAGGGGCAGCTT CAGACTGTG 4645 
QUERY: 4718 ACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGATGCCTACGCGACTGATG 4777 

! M 1 1 M II IMIIIM M II IMIIIM II IMMM M II IMIMI 

SBJCT: 4646 ATTGCAAAAATGATGTCAACTGTAATTGCTATTCTGGGGATGATGGGTATGCCACTGATG 4705 



QUERY: 4778 CCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCATTTACATTGCAG 4837 

IMIMI IMM Mill II M II M M I II IMMMMIMM Mill II I 

SBJCT: 4706 CCATCTTAAATTCACCATCTTCCTTAGCTGTGGCCCCAGATGGTACCATCTACATAGCTG 4765 



45 QUERY: 4 83 8 ACCTTGGAAATATTCGGATCAGGGCGGTCAGCAAGAACAAGCCTGTTCTTAATGCCTTCA 4 897 

I II llllllll II II Mill Mill II MM Ml II I M M I I II I 

SBJCT: 4766 ATCTCGGAAATATCCGCATTAGGGCTGTCAGTAAAAACAGGCCCATTCTTAATTCTTTTA 4 825 
QUERY : 4 898 ACCAGTATGAGGCTGCATC C CC CGGAGAGCAGGAGTTATATGTT T T CAACG CTGATGG CA 4957 

50 MM lllll MIMIII II I! I IMIIIM I Mill Mill I II IMM I 

SBJCT: 4 826 AC CAATATGAAGC TGCAT CT C CAGGAGAACAGGAGCTGT ATGTCTT CAATG CTGATGGGA 4885 
QUERY- 4958 T C CAC CAATACACTGTGAG C CTGGTGACAGGGGAGTACTTGTACAATTT CACATATAGTA 5017 

I Mill MMM I Mill M M MMMMMMMMMIMM lllll I 

55 SBJCT: 48 86 TTCACCAGTACACTCTCAGCCTTGTTACCGGGGAGTACTTGTACAATTTCACCTATAGCA 4 94 5 
QUERY: 5018 CTGACAATGATGTCACTGAATTGATTGACAATAATGGGAATTCCCTGAAGATCCGTCGGG 5077 

Ml M IMIIIM M MM MM Mill II III Mill IMIIMM 

SBJCT: 4 946 GTGATAACGATGTCACCGAGGTGATGGACAGCAATGGCAACTCCTTGAAGGTCCGTCGGG 5005 



QUERY: 5078 ACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGACAACCAGATCATCACCCTCACCG 5137 

I Ml II MIMIII II I II MIMIII M lllll MM M Ml 

SBJCT : 5006 ATGCCAGCGGAATGCCCCGCCATTTACTGATGCCTGATAATCAGATTGTCACGCTGGCCG 5065 



65 QUERY: 5138 TGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCTTGGTCTCATGA 5197 

I Mill Mill M MMM I II M II MM MMM Mill I MM 

SBJCT: 5066 TTGGCACTAATGGTGGACTCAAACTAGTCTCAACGCAGACCCTGGAACTTGGATTAATGA 5125 
QUERY: 5198 CCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTT 5257 

70 | Ml | M MM III III I II II lllll II III I llllll Mill M I 
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SBJCT: 5126 CTTATAACGGAAACAGTGGTCTCTTAGCAACGAAGAGTGATGAAACAGGATGGACAACAT 5185 
QUERY * 5258 TCTATGACTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAA 5317 

I MIIMMII! II Mill lllllllllll II II M 1 1 1 1 1 II MMM! 

SBJCT: 5186 TTTATGACTATGATCATGAAGGGCGCCTGACCAATGTAACACGTCCCACTGGAGTGGTAA 5245 
QUERY * 5318 CCAGT CTGCAC CGGGAAATGGAGAAAT CTAT TAC CATTGACATTGAGAACT C CAAC CGTG 5377 

! II II II II MMIMI II Mlllllllll IMMUIIII II II II I 

SBGCT: 524 6 CTAGCCTTCATCGAGAAATGGAAAAGTCTATTACCATCGACATTGAGAATTCTAATCGGG 53 05 
QUERY * 5378 ATGATGACGTCACTGTCATCACCAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTAC 5437 

IlIMM Mill MMIIM II Mill II II Mill Mill Mill II I 

SBJCT: 53 06 ATGATGATGTCACGGTCATCACAAATCTCTCCTCTGTGGAGGCTTCCTATACAGTTGTTC 53 65 



15 QUERY: 543 8 AAGATCAAGTTCGGAACAGCTACCAGCTCTGTAATAATGGTACCCTGAGGGTGATGTATG 5497 

MIIMMII f ! 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 MM MIIMMII 

SBJCT: 5366 AAGATCAAGTGAGGAACAGCTACCAGCTCTGTAATAATGGTACTTTGAGAGTGATGTATG 5425 
QUERY * 54 98 CTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTAGCGGGCACCATCACCC 5557 

20 MMM III MM Mill MMIIM II MMIMI II M M MM 

SBJCT: 5426 CCAATGGCATGAGTATTAGCTTTCACAGCGAACCTCATGTCCTGGCTGGGACAGTAACTC 5485 
QUERY- 5558 CCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAACTCCATTGAGT 5617 

1 1 M 1 1 1 Mill II II II II II II M M I II 1 1 II II MM Mill I 

25 SBJCT: 5486 CCACCATAGGACGATGTAATATTTCTCTACCAATGGAGAATGGTTTGAACTCAATTGAAT 5545 
QUERY* 5618 GGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCCGGG 5677 

MM II M II IMIIIIMIIIIIIMIII M I MM II MM III 

SBJCT: 5546 GGCGTCTGAGGAAAGAACAGATTAAAGGCAAAGTGACTGTGTTTGGAAGAAAGCTCAGGG 5605 



QUERY- 567 8 TCCATGGAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCT 5737 

I MMIMI III I MIIMMII II II II MM M II MM MM 

SBJCT: 5606 TTCATGGAAGGAATTTGCTGTCCATTGATTACGACCGGAATATACGCACAGAAAAAATCT 5665 



35 QUERY* 573 8 ATGATGACCACCGGAAGTTCACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCC 5797 

I MM MM IMIIMMIIIIMM MM II Ml Ml M MMM 

SBJCT: 5666 ACGATGATCACCGCAAGTTCACCCTGAGGATAATTTACGATCAGCTGGGACGGCCCTTCC 5725 
QUERY - 5798 TCTGGCTGCCCAGCAGCGGGCTGGCAGCTGTC^ACGTGTCATACTTCTTCAATGGGCGCC 5857 

40 Ml MMIMI III MM MM M MM Ml I II MMIMI MMM 

SBJCT: 5726 TCTGGCTGCCCAGCAGCGGCCTGGCTGCCGTCAACGTGTCCTATTTCTTCAACGGGCGCC 5785 
QUERY* 5858 TGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAGACATCGACAAGCAAGGCCGCA 5917 

II 1 1 1 II I II II 1 1 1 1 II MMIIM MIIIIMIIIIMIIMIIIIII I I 

45 SBJCT: 5786 TGGCTGGGCTTCAGCGCGGAGCCATGAGCGAAAGGACAGACATCGACAAGCAAGGCAGGA 5845 
QUERY- 5918 TCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTTGACAAGTCCA 5977 

II I M MMIIM M M MM II MM III MMM II M Ml 

SBJCT : 5846 TCATATCGCGCATGTTTGCAGATGGGAAGGTTTGGAGTTACACCTACCTAGAAAAATCCA 5905 



QUERY * 5978 TGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCC 6037 

Mil II IMMIMMIMI II Mill II IMMUIIII M II Mill I 

SBJCT: 5906 TGGTAC TACTG CTTCAGAGCCAG CGGCAGTACATCT T TGAGTATGAT T CTTCAGAC CGGC 5 965 



55 QUERY: 6038 TCCTTGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCG 6097 

Ml Ml II M MM M M M MM IlIMM II MM M I I 

SBJCT: 5966 TCCATGCTGTTACTATGCCTAGTGTTGCTCGGCATAGCATGTCAACTCACACGTCTGTTG 6025 
QUERY: 6098 G CTACATC CGTAATAT TTACAACC CG CCTGAAAGCAATGCT T CGGT CAT CT TTGACTACA 6157 

60 MMM I MMIIM II II MMM I MM II II II II MM MM 

SBJCT: 6026 GCTACATTAGGAATATTTATAATCCTCCTGAAAGCAACGCATCAGTGATTTTTGATTACA 6085 
QUERY: 6158 GTGATGACGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTACA 6217 

MMM 1 1 I II Ml II II MM M M II II II M II Ml 

65 SBJCT: 6086 GTGATGATGGGAGGATT TTGAAAACAT CATT TTTAGGTAC TGGT CGACAAGT CTTTTACA 6145 
QUERY: 6218 AGTATGGGAAACTCT C CAAGT TAT CAGAGATTGT CTACGACAGTAC CGCCGT CAC CTT CG 6277 

MMM II M MM MM M MM II MMIIM II II II M I 

SBJCT: 6146 AGTATGGAAAGCTATCCAAATTATCTGAAATTGTTTATGACAGTACTGCGGTTACTTTTG 6205 
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QUERY: 6278 GGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTCCT 6337 

I MM! 1 1 1 1 1 1 M M 1 1 I 1 1 MM! M I MMMM M M M I 

SBJCT: 6206 GATATGATGAAACTACAGGTGTCCTAAAAATGGTGAATTTGCAAAGTGGAGGATTTTCTT 6265 
5 QUERY: 633 8 GCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCCG 63 97 

I M Ml ! M M 1 1 MMMM M M MM! 1 1 MMMM MM! I 

SBJCT : 6266 GTACMTCCGCTATCGTAAAATTGGCCCTCTTGTTGACMACAAATCTACAGATTCTCTG 6325 
QUERY: 6398 AGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCG 6457 

10 | MM! I Ml MM ill Ml MM! II II II MMMM M M IMM I 

SBJCT: 6326 AAGAAGGTATGGTCAATGCAAGGTTTGATTATACATATCACGACAATAGTTTTCGCATTG 63 85 
QUERY: 6458 CAAGCATCAAGCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATG 6517 

IMIIIMM Ml MMMMMMMM II II II M I II Mill lllllll 

15 SBJCT: 63 86 CAAGCATCAAACCCATCATAAGTGAGACTCCTCTTCCAGTTGATCTTTACCGTTATGATG 6445 
QUERY: 6518 AGATTTCTGGCAAGGTGGAACACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACC 6577 

MIMMMMM II II II IMM II MMMM II llllllll II II I 

SBJCT: 6446 AGATTTCTGGCAAAGTTGAGCATTTTGGCAAATTTGGAGTTATTTATTATGATATAAATC 6S05 



20 



40 



55 



60 



65 



QUERY: 6578 AGATCATCACCACTGCCGTGATGACCCTCAGCAAACACTTCGACACCCATGGGCGGATCA 6637 

I II II II II II 1 1 1 1 1 1 1 II II II IMM M Mill II M M I 

SBJCT: 6506 AAATTATTACTACAGCAGTTATGACACTGAGTAAGCACTTTGATACCCACGGACGCATTA 6565 



25 QUERY: 663 8 AGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGATGACGGTGCAATATGACA 6697 

I II II II M II 1 1 1 II M II I Mill IIIMIIIIIMM MIMMMMM 

SBJCT: 6566 AAGAAGTTCAATATGAGATGTTCCGATCCCTGATGTACTGGATGACTGTGCAATATGACA 6625 
QUERY : 6698 GCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAATACCACGAAGT 6757 

30 II Mill II II I II II II II Mill Mill MMMM II II MM 

SBJCT: 6626 GCATGGGAAGAGTAACTAAAAGAGAACTGAAACTTGGGCCGTATGCCAACACAACCAAGT 6685 
QUERY: 6758 ACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGGCCGTCAATGACCGCCCGA 6817 

I MMMM M IMM M Mill I II IMM II II Mill I II I 

35 SBJCT: 6686 ATACCTATGATTATGATGGAGATGGGCAATTGCAAAGCGTAGCAGTAAATGATAGGCCTA 6745 
QUERY: 6818 CCTGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTG 6877 

MIMM Mill MMMM Mill IMM Mi I Mill II II lllllll 

SBJCT: 6746 CCTGGCGTTACAGTTATGACCTGAATGGAAATCTTCACCTCCTGAATCCTGGAAACAGTG 6805 



QUERY: 6878 TGCGCCTCATGCCCTTGCGCTATGACCTC 6906 

i ii i linn iiiim mm 

SBJCT: 6806 TTCGATTGATGCCCCTGCGCTACGACCTC 6834 



45 SCORE = 1241 BITS (626), EXPECT = 0.0 

IDENTITIES = 1553/1862 (83%) 
STRAND = PLUS / PLUS 



50 



QUERY : 


1486 


AGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCGGCGGGTAACACAAGAAGTCCCACCA 


1545 


SBJCT : 


1414 


miiimii mil ill iiMiiiiii ii ill ii urn ii ii ii 

AGCAGCATAGATAGTGGAGAAACAGAAGTTGGCCGCAAGGTCACCCAAGAGGTGCCCCCT 


1473 


QUERY : 


1546 


GGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCTTAAAGTTCAACATC 


1605 


SBJCT : 


1474 


M Mill III MM II II II Mill Mill llllll 1 lllllllllll 

GGAGTGTTCTGGCGGTCTCAGATCCATATCAGCCAGCCACAGTTCCTGAAGTTCAACATA 


1533 


QUERY: 


1606 


TCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCTCAT 


1665 


SBJCT : 


1534 


Mill MMMM Mill II MMMM IMIMMMMM llllllll Ml 

TCCCTAGGGAAGGATGCTCTTTTCGGTGTTTATATAAGAAGAGGACTCCCACCATCACAT 


1593 


QUERY : 


1666 


G C C CAGTATGAC TT CATGGAACG T C TGGACGGGAAGGAGAAGTGGAGTGTGGT TGAGT CT 


1725 


SBJCT : 


1594 


II MMMM lllllllllll MM Mill Mill IMMMIMI II II 

GCACAGTATGATTTCATGGAACGCTTGGATGGGAAAGAGAAATGGAGTGTGGTGGAATCC 


1653 


QUERY : 


1726 


C CCAGGGAACGC CGGAGCATACAGACCTTGGTT CAGAATGAAGC CGTGTTTGTG CAGTAC 


1785 


SBJCT : 


1654 


II lllllll II II II Mill 1 MIIMIMM II MMMM llllll 

CCACGGGAACGGCGAAGTATTCAGACTCTTGTTCAGAATGAGGCTGTGTTTGTTCAGTAC 


1713 


QUERY : 


1786 


CTGGATGTGGGCCTGTGGCATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTT 


1845 



70 1 1 1 f 1 1 1 1 1 1 lllllll Mill M MIIMIMM I 
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SBJCT: 1714 TTGGATGTGGGTTTGTGGCACCTGGCGTTTTACAATGATGGCAAGGACAAAGAAGTGGTC 1773 



10 



30 
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QUERY* 1846 TCCTTCAATACTGTTGTCCTAGATTCAGTGCAGGACTGTCCACGTAACTGCCATGGGAAT 1905 

MINN II! Ill I I IIMMIMM 1 1 1 1 1 1 1 II 1 1 1 1 1 II Mill III 

SBJCT: 1774 TCCTTCAGTACAGTTATTTTGGATTCAGTGCAAGACTGTCCACGTAATTGTCATGGCAAT 1833 
QUERY * 1906 GGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATTTCTAGGAGCAGACTGTGCT 1965 

II 1 1 Mill M II M II Mllllll II M II 11 1 II II M M M II 1 1 

SBJCT: 1834 GGCGAGTGTGTTTCTGGTGTCTGCCACTGTTTTCCCGGATTTCATGGAGCAGATTGTGCT 1893 
QUERY * 1966 AAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACAATATTCTAAAGGGACGTGCCAG 2025 

MIMMIIIMM II IMIMMIM Mill II M II Mill M Ml I 

SBJCT: 1894 AAAGCTGCCTGCCCGGTGCTGTGCAGTGGCAATGGTCAGTACTCCAAAGGAACCTGCTTG 1953 



15 QUERY- 2026 TGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAATCAGTGCATCGATCCT 20 85 

IIMllll IMMIMMM I M II M II Mill I Mill II Mill 

SBJCT: 1954 TGCTACAGTGGCTGGAAAGGTCCGGAATGTGATGTACCCATCAGCCAGTGTATTGATCCC 2013 
QUERY : 2086 TCCTGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAA 214 5 

20 M M || M II M IIMllll II MM MM Mill I II MUM III 

SBJCT: 2 014 TCGTGTGGAGGTCATGGTTCCTGCATCGAAGGGAACTGTGTCTGTTCCATTGGCTATAAA 2073 
QUERY * 2146 GGCGAGCACTGTGAGGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGT 2205 

II M M M M M I II II M I M 1 1 M Mill M MIMM Mill IIMM 

25 SBJCT: 2074 GGAGAAAACTGTGAGGAAGTTGATTGCTTAGATCCAACATGCTCCAATCACGGGGTCTGT 2133 
QUERY- 2206 GTGAATGGAGAATGCCTGTGCAGCCCTGGCTGGGGTGGTCTGAACTGTGAGCTGGCGAGG 2265 

Mill II M M II II IIMllll IIMMIMM I MMIIIMM I II 

SBJCT: 2134 GTGAACGGAGAATGTCTCTGCAGCCCAGGCTGGGGTGGAATAAACTGTGAGCTTCCCAGA 2193 



QUERY: 2266 GTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACGTACCTGCCTGACACGGGCCTCTGC 2325 

I I M II II II II II II I II II II M II I II M M MIMI MIMM II Mill 

SBJCT: 2194 GCCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACATACCTGTCTGACACCGGTCTCTGT 2253 



35 QUERY: 2326 AGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGAAGTGTGCTCAGTAGACTGT 23 85 

II II II II II II M I II II II II II II II M 1 1 M MMMIMIMM MIMMII 

SBJCT: 2254 AGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCCGTTG7AAGTGTGCTCTGTAGACTGT 2313 
QUERY- 2386 GGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCA 244 5 

40 mil || || M Mill II Mill II IIMMIMM II II IIMM I 

SBJCT: 2314 GGCACCCATGGGGTGTGCATTGGCGGAGCGTGTCGCTGTGAAGAAGGGTGGACAGGAGTG 2373 
QUERY: 2446 GCGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGAGCACGGGACCTGTAAAGATGGC 2505 

MIMMIIIMM Mill II Mill II I IIIIMM M MMIIIMM 

45 SBJCT: 23 74 GCGTGTGACCAGCGTGTGTGTCATCCCCGGTGTACAGAGCACGGAACTTGTAAAGATGGG 24 33 
QUERY: 2506 AAATGTGAATGCCGAGAGGGCTGGAATGGTGAACACTGCACCATTGGTAGGCAAACGGCA 2565 

MIMMMMI II M M M II II II M II I II I II II II II II II I II 1 1 1 II II 

SBJCT: 2434 AAATGTGAATGCAGAGAGGGCTGGAATGGGGAGCACTGCACCATTGGTAGGCAAACGACA 2493 



QUERY: 2566 GGCACCGAAACAGATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATGCACACTGGGT 2625 

1 H 1 1 H 1 1 M E 1 1 1 N I M 1 1 1 1 1 M 1 1 1 M 1 1 1 II IMMMI Mill Mill 

SBJCT: 24 94 GGCACCGAAACAGATGGCTGCCCTGACTTGTGCAATGGCAACGGGAGGTGCACGCTGGGC 2553 



55 QUERY: 2626 CAGAACAGCTGGCAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCC 2685 

II II II M M M II I II M M M I II M M M II II II II II II MMMIIIIMMI 

SBJCT: 2554 CAGAACAGCTGGCAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCTGGATGCAACGTTGCC 2613 
QUERY : 2686 ATGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGCCTGGTGGATTGTTTG 2745 

60 IIIIMM II Ml III III MM MM III IMMIMMM MM II II I 

SBJCT: 2614 ATGGAAACCTCCTGTGCCGATAACAAGGATAACGAGGGAGATGGCTTGGTTGACTGCCTA 2 673 
QUERY: 2746 GACCCTGACTGCTGCCTGCAGTCAGCCTGTCAGAACAGCCTGCTCTGCCGGGGGTCCCGG 28 05 

I Ml II IIIIMM Mill I Mill IIMMIMM IIIIMM Mill 

65 SBJCT: 2674 GTCCCAGATTGCTGCCTCCAGTCCACTTGTCAAAACAGCCTGCTGTGCCGGGGTTCCCGC 2733 
QUERY: 28 06 GACCCACTGGACATCATTCAGCAGGGCCAGACGGATTGGCCCGCAGTGAAGTCCTTCTAT 2865 

II II M IIIIMM II Ml MM I I II II II Mllllll IIMM 

SBJCT: 2734 GATCCTCTTGACATCATACAACAGAGCCATTCTGGTTCACCAGCTGTGAAGTCATTCTAT 2793 
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QUERY • 2866 GACCGTATCAAGCTCTTGGCAGGCAAGGATAGCACCCACATCATTCCTGGAGAGAACCCT 2 925 

II || IIIIIIIIIII I II Mill Mill II IIIIIIII Mill II 1 1 

SBJCT: 2794 GATCGAATCAAGCTCTTAGTGGGGAAGGACAGCACTCATATCATTCCAGGAGAAAATCCC 2853 
5 QUERY • 2926 TTCAACAGCAGCTTGGTTTCTCTCATCCGAGGCCAAGTAGTAACTACAGATGGAACTCCC 2985 

! M I M I II M I I II Mill II IIIMMIII 1 1 IMMIIIMMII II 

SBJCT: 2854 TTCAACAGCAGCCTTGTGTCTCTTATAAGAGGCCAAGTGGTGACTACAGATGGAACGCCT 2 913 
QUERY * 2986 CTGGTCGGTGTGAACGTGTCTTTTGTGAAGTACCCAAAATACGGCTACACCATCACCCGC 3 045 

10 * MM II M IIIIIIII III MM MM Mill II I III MUM III II 

SBJCT: 2914 CTAGTTGGGGTCAACGTGTCATTTGTCAAGTATCCAAAGTATGGCTATACCATCACTCGT 2973 
QUERY * 3046 CAGGATGGCACGTTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTCTACACTTTGAG 3105 

MIMIMM III III II I M II II M I III I III i IMIIMI 

15 SBJCT: 2974 CAGGATGGCATGTTTGACTTGGTTGCTAACGGTGGATCATCCCTAACTTTGCACTTTGAA 3033 
QUERY- 3106 CGAGCCCCGTTCATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCC 3165 

M Mill II Mill Mill I M II MIIIIMIIMIIIIIMI II Ml 

SBJCT: 3034 CGGGCCCCATTTATGAGTCAGGAAAGGACAGTATGGCTGCCGTGGAACAGCTTCTATGCC 3093 

20 

QUERY • 3166 ATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGCTGTGACCTCAGTGGC 3225 

MIMIII M M Mill M MMMMMMM MIMIMMI IMIIMM 

SBJCT: 3 094 ATGGACACG CTTGTAATGAAAACAGAGGAGAACT C CATT C C CAGC TGTGAT C T CAGTGGC 3153 
QUERY* 3226 TTTGTCCGGCCTGATCCAATCATCATCTCCTCCCCACTGTCCACCTTCTTTAGTGCTGCC 3285 

mm i iiiMMM MMMi ii it ii ii i m i ii urn mi m 

SBJCT: 3154 TTTGTCAGACCTGATCCAGTCATCATTTCATCACCACTGTCAACTTTCTTCAGTGATGCT 3213 
30 QUERY * 3 286 CCTGGGCAGAATCCCATCGTGCCTGAGACCCAGGTTCTTCATGAAGAAATCGAGCTCCCT 3 345 

mil i urn m ii ii ii iimMMiiiMiiiiiMii m mil 

SBJCT: 3214 CCTGGCCGAAATCCTATTGTACCAGAAACCCAGGTTCTTCATGAAGAAATTGAGGTCCCT 3 273 



25 



35 



45 



65 



QUERY: 3346 GG 3347 
II 

SBJCT: 3274 GG 3275 



SCORE = 547 BITS (276) , EXPECT - E-152 
IDENTITIES =* 540/628 (85%) 
40 STRAND = PLUS / PLUS 



QUERY * 782 GTCGTCCCATTCCACCTACATCCTCGCCTAGTCTCCTCCCATCTGCTCAGCTGCCTAGCT 841 

IIIIIIIIIIMIIIilllllMIII MM II MIIIMimilllMIM M I 

SBJCT: 587 GTCGTCCCATTCCACCTACATCCTCGTCTAGCCTTCTCCCATCTGCTCAGCTGCCCAGTT 64 6 
QUERY* 842 CCCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCATC 901 

I IIIIIIIMIIIIIIIIIMMMIIIIIIIIIIIMIIMMMM II MMMI 

SBJCT: 647 CTCATAATCCTCCACCAGTTAGCTGCCAGATGCCATTGCTAGACAGCAATACGTCCCATC 706 



50 QUERY: 902 AAATCATGGACACCAACCCTGATGAGGAATTCTCCCCCAATTCATACCTGCTCAGAGCAT 961 

MIIIIMIIIIMII Mill Mill Mill II IMMIMMI II MMMI 

SBJCT : 707 AAATCATGGACACCAATCCTGACGAGGAGTTCTCTCCTAATTCATACCTACTAAGAGCAT 766 
QUERY: 962 GCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGCCCTCCGAACCACCACAGCCAGTCGACTC 1021 

55 | IIIIIIII Mill I! I MMMI MM II I I 1 1 1 1 1 I MM II ill I II I 

SBJCT: 767 GTT CAGGGC CACAG CAGG CAT C CAG CAGTGG CC CTT CAAACC AT CACAGC CAGT CAACGC 826 
QUERY * 1022 TGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGTCCGCCA 1081 

1 1 1 1 Ml IIIIIIII II IMIIMMIM MMMMIIIII IIIIIIII MM 

60 SBJCT: 827 TGAGGCCACCTCTCCCCCCTCCTCACAACCACTCGCTGTCCCATCATCACTCGTCTGCCA 886 
QUERY: 1082 ACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCAG 1141 

IIIIIIIMIIMMIIII II Mill II II I IMMIMMI M II II I 

SBJCT: 8 87 ACTCCCTCAACAGGAACTCGCTCACCAACCGCCGCAACCAGATCCACGCGCCTGCTCCCG 946 



QUERY: 1142 CGCCCAATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTAA 1201 

I MMMMMMM Mill II Mill II Mill IIMIIIMIIIIIIM I 

SBJCT: 947 CTCCCAATGACCTGGCGACCACGCCTGAGTCTGTGCAGCTGCAGGACAGCTGGGTGCTCA 1006 



70 QUERY: 1202 ACAGCAACGTG C CAC TGGAGAC CCGGCACT TCCT CTT CAAGAC C TC CT CGGGGAG CACAC 1261 
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1 1 1 1 i 1 1 f 1 1 E 1 1 IMIM1M III! Ml I I! Mill M II III II I 

SBJCT: 1007 ACAGCAACGTGCCGCTGGAGACCAGGCATTTCTTGTTTAAGACATCTTCTGGAACGACTC 10 6 6 
QUERY * 1262 C CTTGTT CAG CAG CT CT T C C C CGGGATAC CC TT TGAC CTCAGGAACGGTTTACACG CCC C 1321 

5 * | MIIMI IMIIMMM M Mill IIMIIIIIIMI Mill I! II I 

SBJCT: 1067 CGCTGTTCAGTAGCTCTTCCCCTGGCTACCCACTGACCTCAGGAACAGTTTATACTCCAC 1126 
QUERY* 1322 CGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAGGCTTTCAAGCTGAAGAAGCCCT 13 81 

I Ml I Ml I M II Mill II llllllll M MIMIIMM IMIIII 

10 SBJCT: 1127 CTCCCAGGCTGTTACCTAGAAATACATTTTCCAGGAATGCATTCAAGCTGAAAAAGCCCT 1186 



15 



35 



55 



QUERY: 13 82 CCAAATACTGCAGCTGGAAATGTGCTGC 14 09 

MM II II I I 1 I t I I 1 1 t I 1 1 t I I I 
SBJCT: 1187 CCAAGTATTGTAGCTGGAAATGTGCTGC 1214 

SCORE = 391 BITS (197) , EXPECT = E-105 
IDENTITIES = 593/725 (81%) 
STRAND = PLUS / PLUS 



20 QUERY * 7156 CATGTCTACAATCACTCCAACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCAC 7215 

II II M M II 1 1 1 1 Mill II II MIIIMI Mill M M II M MUM 

SBJCT: 7084 CATGTCTACAATCATTCCAATTCAGAAATTACCTCTCTGTATTATGATCTGCAAGGCCAC 7143 
QUERY* 7216 CTCTTTGCCATGGAGAGCAGCAGTGGGGAGGAGTACTATGTTGCCTCTGATAACACAGGG 7275 

25 MIIIMI MIIMI! MIMIIMM II II Mill II! II MM II II I! 

SBJCT: 7144 CTCTTTGCAATGGAGAGTAGCAGTGGGGAAGAATATTATGTCGCCTCCGATAACACGGGC 7203 
QUERY * 7276 ACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAAACAGCTGCAGTACACGGCC 7335 

Mill II II II Nil IMIIIIIIIIIIIIIMM IIMIMI II 

30 SBJCT: 7204 ACTCCGCTAGCCGTATTCAGCATCAATGGCCTCATGATCAAACAGCTTCAGTACACTGCA 7263 
QUERY • 7336 TATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCTTCCATGGG 73 95 

II M MIIMMMMIMM Mill M MMM MM Mill IIMIMI 

SBJCT: 7264 TACGGAGAGATTTATTATGACTCAAACCCTGATTTCCAGCTGGTTATTGGGTTCCATGGA 7323 



QUERY: 7396 GGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTG 7455 

II II Mill II I Mill M Mill I! II M I II II Mill II 

SBJCT: 7324 GGGCTGTATGATCCTTTAACCAAACTCGTCCATTTTACCCAAAGGGACTACGATGTCCTT 7383 



40 QUERY: 7456 GCAGGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCC 7515 

II Mill Mill II II II M II MMIIIIIIM I II I M M I! 

SBJCT: 73 84 GCTGGACGCTGGACATCTCCTGATTACACAATGTGGAAAAACATTGGTAGAGAACCTGCT 7443 
QUERY* 7516 CCCTTTAACCTGTATATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAG 7575 

45 |! Ml || Mill II MM Mill Mill MMM MM Ml II III I Ml 

SBJCT: 7444 CCCTTCAATCTGTACATGTTCAAGAGTAACAACCCTCTCAGCAATGAACTGGATCTAAAG 7503 
QUERY: 7576 AACTACGTGACAGATGTGAAAAGCTGGCTTGTGATGTTTGGATTTCAGCTTAGCAACATC 763 5 

I! II II IIMIMI MIMIIMM IIMIMI I M M M M M 1 1 II M II I 

50 SBJCT: 7504 AATTATGTAACAGATGTCAAAAGCTGGCTGGTGATGTTCGGATTTCAGCTTAGCAACATT 7563 
QUERY: 7636 ATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTGCCTCCTCCCTATGAATTGTCAGAG 7695 

II II 1 1 M M II II Mill I II II I M II Ml I Mill M I! II I III 

SBJCT: 7564 ATTCCTGGCTTCCCTAGAGCAAAAATGTACTTTGTGTCACCTCCATACGAGCTGACTGAG 7623 



QUERY: 7696 AGTCAAGCAAGTGAGAATGGACAGCTCATTACAGGTGTCCAACAGACAACAGAGAGACAT 7755 

IIMIMI I II I IIMMIMM IIMIMI Mill MIMIIMM Mill 

SBJCT: 7624 AGTCAAGCGTGTGAAAATGGACAGCTAATTACAGGAGTCCAGCAGACAACAGAAAGACAC 7683 



60 QUERY: 7756 AACCAGGCCTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCATC 7815 

M II M IMIIMMM II IMIIMMM MMM I II Mill 

SBJCT: 7684 AATCAAGCTTTCATGGCTCTTGAGGGACAGGTCATATCTAAAAGATTACATGCCAGTATT 7743 
QUERY: 7816 CGAGAGAAAGCAGGTC^CTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCATG 7875 

65 MM llllllll IMIIMMM II I III II II Mill Mill MMM 

SBJCT: 7744 AGAGAAAAAGC^GGCCACTGGTTTGCAACAAGCACTCCTATTATTGGGAAAGGAATCATG 7803 

QUERY: 7876 TTTGC 7880 
Mill 

70 SBJCT: 7804 TTTGC 7 80 8 

73 15966-697 



SCORE = 339 BITS (171), EXPECT = 2E-89 
IDENTITIES = 429/515 (83%) 
STRAND = PLUS / PLUS 



QUERY: 7967 ACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTGAAGA 8026 

IIMIIMM II IIIIIIIIIIIIIIIIIIII Mill M M 1 1 1 1 M I M MM 

SBJCT: 7895 ACTAC CTGGAAAAAATGCAC TACAGCATCGAGGGGAAGGATACT CACT ACTTTGT CAAGA 7954 
10 QUERY: 8027 TTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAG 8086 

I MMMIIMM IIIIMI Mill M I I MM Ml I Mill II I 

SBJCT: 7955 TAGGCTCAGCCGATAGCGACCTCGTCACCCTCGCGATGACCAGCGGGAGGAAGGTCCTGG 8014 
QUERY* 80 87 AGAGCGGGGTGAACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAA 8146 

15 | Mill II IMIMMIM IMIIMI II II II IIIIMI IMIIMM 

SBOCT: 8015 ACAG CGGAGTAAACGTGACCGT CT CC CAG C CAAC C CT CCT TATCAACGGAAGGACT CGAC 8074 
QUERY * 8147 GGTTCACGAACATTGAGTTCCAGTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCA 8206 

IIIMM Mill Mill Mill Mill MIMI Ml MIIIIMI II MM 

20 SBJCT: 8075 GGTTGACAAACATCGAGTTTCAGTATTCCACCCTGCTGATGAACATCCGCTACGGGCTCA 8134 
QUERY: 8207 CCCCCGACACCCTGGACGAAGAGAAGGCCCGCGTCCTGGACCAGGCGAGACAGAGGGCCC 8266 

II MIMM Mill M MMMM II M II IMIIMI I Ml I MM 

SBJCT: 8135 CCGCCGACACGCTGGATGAGGAGAAGGCACGAGTGCTAGACCAGGCTCGGCAGCGAGCCC 8194 



25 



QUERY* 8267 TGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGGAGAGAGGGGAGCCGCC 8326 

MM MIMIMMMI IMMIIMM M MM II I Mill MIMI 

SBJCT: 8195 TGGGGTCGGCCTGGGCCAAAGAGCAGCAGAAGGCACGGGATGGCCGCGAGGGCAGCCGCG 8254 



30 QUERY: 8327 TGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTACGAGG 8386 

I Mill M II MMMM IIMIIMM Ml II I II Mill IIIIMI 

SBJCT: 8255 TATGGACAGACGGAGAGAAGCAACAGCTTCTGAACACGGGAAGGGTTCAAGGTTACGAGG 8314 
QUERY * 83 87 GATATTACGTGC TT CC CGTGGAG CAATACC CAGAG CT TG CAGACAGTAGCAGCAACAT CC 8446 

35 || | HI | M | || 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

SBJCT: 8315 GATATTATGT CTTG CC TGTGGAGCAGTAC C C AGAGCTAG CAGACAG TAG CAGCAACAT C C 83 74 

QUERY: 8447 AGTTTTTAAGACAGAATGAGATGGGAAAGAGGTAA 84 81 

I II II M II II II II II II II I I I I II M II I I I 
40 SBJCT: 83 75 AGT TTTTAAGACAGAATGAAATGGGAAAGAGGTAA 84 09 

SCORE = 323 BITS (163), EXPECT = 1E-84 
IDENTITIES = 397/475 (83%) 
STRAND = PLUS / PLUS 

45 





QUERY : 


299 


GACACCGCTCTTTGACCAGAGGACGCTGTGGCAAAGAGTGTCGCTACACAAGCTCCTCTC 


358 




SBJCT : 


20 


MMMMIMIMM Mill II II II M MIMMIMI II II II II 1 

GACAC CGC T C TT TGACGAGAGG C CGGTGCGGGAAGGAGTGT CG CTATACTAGT TC TT CAC 


79 


50 


QUERY : 


359 


TGGACAGTGAGGACTGC CGGGTGCCCACACAGAAATCCTACAGCT CCAGTGAGACTCTGA 


418 




SBJCT : 


80 


1 MMMM MMM 1 II M 1 Mill 1 1 1 11 II 1 II 1 1 1 1 M M M MM 

TCGACAGTGAAGACTGCAGAGTACCAGCTCAGAAGTCCTACAGCTCCAGTGAGACCCTGA 


139 


55 


QUERY : 


419 


AGGCCTATGACCATGACAGCAGGATGCACTATGGAAACCGAGTCACAGACCTCATCCACC 


478 


SBJCT : 


140 


1 M MM IMIIMI IMIMIMM Mill Mill IIIIMI 1 III 

AAGCATATGGCCATGACACGAGGATGCACTACGGAAATCGAGTTTCAGACCTGGTTCACA 


199 




QUERY : 


479 


GGGAGTCAGATGAGTTTCCTAGACAAGGAACCAACTTCACCCTTGCCGAACTGGGCATCT 


538 


60 


SBJCT : 


200 


MIMM MIMMIMI II MMMM MMMIMIMM MMMM MM 

GGGAGTCGGATGAG TTT C CAAGG CAAGGAACGAACTTCAC C CTTG CAGAACTGGGAAT CT 


259 




QUERY : 


539 


GTGAGCCCTCCCCACACCGAAGCGGCTACTGCTCCGACATGGGGATCCTTCACCAGGGCT 


598 


65 


SBJCT : 


260 


MIIIIMM II II Mill MIMMIMI Mill 11 II M II II III! 

GTGAGCCCTCTCCCCATCGAAGTGGCTACTGCTCGGACATAGGAATACTCCATCAAGGCT 


319 


QUERY: 


599 


ACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGGGAGGGATGTCTCCAGAAC 


658 




SBJCT : 


320 


1 III 1 HIM II Mill II Mill Mill HIM 1 M II 11 II II M 1 1 

ATTCCTTGAGCACTGGCTCTGATGCTGACTCAGACACGGAGGGCGGGATGTCTCCAGAGC 


379 


70 


QUERY : 


659 


ACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTC 


718 
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MM MMI MMMM MMMM MMMM M MMI MMMM MM 

SBJCT: 380 ACGCGATCAGGCTGTGGGGAAGAGGGATCAAATCCAGCCGAAGTTCTGGCCTGTCAAGTC 439 
QUERY- 719 GTGAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGA 773 

5 IIIIIIIIIMII II II 1 1 IIIMIII Mill II Mill M NM 

SBJCT: 440 GTGAAAACTCGGCTCTCACGCTCACTGACTCCGACAATGAGAACAAGTCAGATGA 494 

The full FCTR3a amino acid sequence also has 342 of 383 amino acid residues (89%) 
identical to, and 342 of 383 residues (89%) positive with, the 276 amino acid residue Odd 
10 Oz/ten-m homolog 2 (Drosophila) (GenBank Acc: NP_03 5986.2) (SEQ ID NO:68) (Table 3P). 

Table 3P. BLASTP of FCTR3a against Odd Oz/ten-m homolog 2 - (SEQ ID NO:68) 

> GI|7657415|REF|NP 035986.21 ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) ; ODD OZ/TEN-M HOMOLOG 
3 

(DROSOPHILA) [MUS MUSCULUS] 
15 GI | 4760778 | DBJ[ BAA77397 . 1 | (AB025411) TEN-M2 [MUS MUSCULUS] 

LENGTH =2764 



20 



25 



35 



40 



45 



55 



SCORE = 495 BITS (1274), EXPECT = E-139 

IDENTITIES = 342/383 (89%), POSITIVES = 342/383 (89%), GAPS = 41/383 (10%) 



QUERY: 


37 


SBJCT : 


189 


QUERY : 


97 


SBJCT : 


249 


QUERY : 


157 


SBJCT : 


309 


QUERY : 


217 


SBJCT : 


369 


QUERY : 


244 


SBJCT : 


429 


QUERY : 


296 


SBJCT : 


489 


QUERY : 


356 


SBJCT : 


549 



MMMMMMMMMMMMMMMMMMMMMMMMMMMMMI 

HNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQST3 
RPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVLl 

M M M M M M M M M M M M M I M M M M M M M M M I M M M ! M M M 

RPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQLQDSWVL] 
SNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPI 

30 ~ M M M M M M M M M M M M M I M M M M M M M M M M M M M M M M 

/PLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKP; 
"SWKCAALSAIAAALLLAILLAYFI 

MMMMMMMMMMMMI 

^SWKCAALSAIAAALLLAILIjAYFIAMHLLGLNWQLQPADGHTFNNGTOTGLPGNDD' 
VPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGK 

IIIIIIIIIIIIIIIMIIIMIIIIMIIIIMMIIIMIIIMIIIII 

i/PSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGK 
FGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVG: 

IIIMIIIIIIIIIIMIIMIIMIMMIIIIIIIIIIIMIIIIIMIMIMI 

FGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVG 
LAFYNDGKDKEMVS FNTWLD 378 

MMMMMMMMMMI 



243 



The full FCTR3b amino acid sequence has 2442 of 2802 amino acid residues (87%) 
50 identical to, and 2532 of 2802 residues (90%) positive with, the 2802 amino acid residue 
teneurin-2 [Gallus gallus] (GenBank Acc: AJ279031) (SEQ ID NO:69) (Table 3Q). 

Table 3Q. BLASTP of FCTR3a against Teneurin-2 - (SEQ ID NO:69 



> GI | 10241574 lEMBlCAC09416 .1 1 (AJ279031) TENEURIN-2 [GALLUS GALLUS] 
LENGTH = 2 802 

SCORE = 4853 BITS (12589), EXPECT = 0.0 

IDENTITIES = 2510/2802 (87%), POSITIVES = 2600/2802 (90%), GAPS = 69/2802 (2%) 



75 



15966-697 



MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKS YSSSETLKAYDHDSRMHYGNR 6 0 

IMIIIMIMIIIMMIIIIMMIMIMM MMIIIIIMII IMIIIMI 

MDIKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPAQKSYSSSETLKAYGHDTRMHYGNR 6 0 
VTDLIHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 120 

| + || + IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIillllllllllllllll 

VSDLVHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDIGILHQGYSLSTGSDADSDTE 120 
GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168 

10 ~ I { 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 I II I III II Mill 1 1 II II Ml M II 



15 



20 



25 



■IV PWSLKNSSIDSGEAE 411 

30 " Ml II +111 Mill I 



35 



40 



45 



55 



60 



65 



QUERY : 


1 


fir* t nm 

SB JCT : 


1 


QUERY : 


61 


SBJC1 : 




QUERY : 


121 


brJUCi : 


m 
±Z 1 


QUERY : 


lfey 


SB JCT : 


181 


QUERY : 


213 


SB JCT : 


241 


QUERY : 


273 


rrn -T/-trn _ 

SBJC1 : 


3 01 


QUERY : 


333 


SB JCT : 


361 


QUERY : 


393 


SB JCT : 


421 


QUERY : 


412 


fin -T <— irn 

SBJCT : 


4 81 


QUERY : 


472 


SBJCT : 


541 


QUERY : 


532 


SBJCT : 


601 


QUERY : 


592 


SBJCT : 


661 


QUERY : 


652 




721 


QUERY : 


712 


SBJCT : 


781 


QUERY: 


772 


SBJCT : 


84 1 


QUERY : 


83 2 


CO T/^T 1 . 

bBJLi : 


901 


QUERY : 


: 892 


SBJCT : 


: 961 


QUERY: 


: 952 



MINIM MIMMIMIMIMI 

)QLAETKHSLIRRPIPPTSSSSLLPSAQLPSSHNPPP^ 
IFSPNSYLLRACSGPQQASSSGPPNHHSQSTLRPPLP3 

1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

JFSPNSYLLRACSGPQQASSSGPSNHHSQSTLRPPLP] 
[XXXQIHAPAPAPNDLATTPESVQLQDSWVLNSNVPL] 

MIIMMIIIIIIIMMMIIMMIIMII 

IRRNQIHAPAPAPNDLATTPESVQLQDSWVLNSNVPL] 
r PLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSW] 

IIMMIIIIIIIIIIMMM IIMIIilllMI 



IMMMMMMMIMMMMMMMMMMMMIMMMMMMMMI 

3RKVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPPSHAQYDFMERL 
3KEKWSWESPRERRSIQTLVQNEAVFVQYLDVGLWHLAFYNDGKDKEMVSFNTWLDS 

M 1 1 1 II 1 1 II II 1 1 1 M II M M II II II 1 1 II M II I II II M II Ml i + l M II 

^KEKWSWES PRERRS IQTLVQNEAVFVQYLDVGLWHLAFYNDGKDKE WSFSTVILDS 
QDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGWKGAE 

M I M I M M M M M II 1 1 M II I M M I II II I II M li II II II I II M I M I 

QDCPRNCHGNGECVSGVCHCFPGFHGADCAKAACPVLCSGNGQYSKGTCLCYSGWKGPE 
DVPMNQCIDPSCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCSP 

MI+-MIIIMIMMMMMMI M II M I II M I II II hi 1 1 II II II II II 

DVPISQCIDPSCGGHGSCIEGNCVCSIGYKGENCEEVDCLDPTCSNHGVCVNGECLCSP 
WGGLNCEIARVQCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGG 

50 " MINIM I MIMMMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

^JGGINCELPRAQCPDQCSGHGTYLSDTGLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGG 
:RCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCTIGRQTAGTETDGCPD 

IMIMIM i M 11 1 It I) I IMIIMIIMIMIMMMIIMM IIMIMM 

^RCEEGWTGVACDQRVCHPRCTEHGTCKDGKCECREGWNGEHCTIGRQTTGTETDGCPD 
CNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLVDCLDPDCCLQSA 

M II I II I II II II 1 1 II II II II II M 1 1 II I M M M I II 1 1 II M 1 1 II M II I 

CNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLVDCLVPDCCLQST 
3NSLLCRGSRDPLDIIQQGQTDWPAVKSFYDRIKLLAGKDSTHIIPGENPFNSSLVSLI 

II M I II II II M II II I + MIMMMMM I II I M I M II II 1 1 II II II I 

QNSLLCRGSRDPLDI IQQSHSGSPAVKSFYDRIKLLVGKDSTHI IPGENPFNSSLVSLI 
3QWTTDGTPLVGVNVSFVKYPKYGYTITRQDGTFDLIANGGASLTLHFERAPFMSQER 

IIIMIIMMIMMIIMMIMIMIMM MMIII + IIIIIMIIIIMIII 



70 I U 1 1 1 1 1 1 M 1 1 1 1 f 1 1 M 1 1 1 E I ! 1 1 M 1 1 II 1 1 1 i I M t M 1 1 ! t MMIIIM 

76 15966-697 



SBJCT: 1021 TVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPVI ISSPLSTFFSDAPGRNPIVPE 1080 



10 



30 



QUERY: 1012 TQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKXTMTQSTVPLNLIRVHLMVAVEGHLFQK 1071 

IMMMII + Mi++M iiillliiillllll MM 1 1 1 1 1 l + M M II M M II M 

SBJCT: 1081 TQVLHEEIEVPGSSIKLIYLSSRTAGYKSLLKIIMTQSLVPLNLIKVHLMVAVEGHLFQK 1140 
QUERY * 1072 SFQASPNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLILWEKRTALLQGFELD 1131 

M MMM M M M M M M + M I M M M M M M M M M M M M M M U M M 

SBJCT: 1141 SFLASPNLAYTFIWDKTDAYGQKVYGLSDAWSVGFEYETCPSLILWEKRTALLQGFELD 1200 
QUERY: 1132 PSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGNGRRRSISCPSCNGLA 1191 

M M I M M M M + M M M M M ! M M M M M l + l i M M I M M M M M M i M 

SBOCT: 1201 PSNLGGWSLDKHHVLNVKSGILHKGNGENQFLTQQPAVITS IMGNGRRRS ISCPSCNGLA 1260 



15 QUERY: 1192 EGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYYLA 1251 

MMMMMMMMMI + MMMMMMMMMMMIMMMMMMMMI 

SBJCT: 1261 EGNKLI^PVALAVGIDGSLFVGDFNYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYYIA 1320 
QUERY: 1252 VDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGEQCLPFDEARCGDGGKAID 1311 

20 MIMMMMMMMM + MM + MIMMMMMMIMMMMMMMMi + l 

SBJCT: 1321 VDPVSGSLYVSDTKSRRIYKVKSLTGTKDLAGNSEWAGTGEQCLPFDEARCGDGGKAVD 1380 
QUERY: 1312 ATLMS PRGIAVDKNGLMYFVDATMIRKVDQNGI I STLLGSNDLTAVRPLSCDS SMDVAQV 1371 

MMMIMMM MMMMMMMMIMMMMMMMMMMMMM + M 

25 SBJCT: 13 81 ATLMSPRGIAVDKYGLMYFVDATMIRKVDQNGI I STLLGSNDLTAVRPLSCDSSMDVSQV 1440 
QUERY * 1372 RLEWPTDLAVNPMDNSLYVLENNVILRITENHQVSIIAGRPMHCQVPGIDYSLSKXXXXX 1431 

MMIMMi + MMMMMMMMMMIMMIMMMMMMMMM 

SBJCT: 1441 RLEWPTDIAVDPMDNSLYVLENNVILRITENHQVSIIAGRPMHCQVPGIDYSLSKLAIHS 1500 



QUERY: 1432 XXXXXXXXXXXXTGVLYITETDEKKINRLRQVTTNGEICLLAGAASXXXXXXXXXXXXYS 1491 

IIIIM + MMMMMMMMMMIIIMM 1 1 

SBJCT: 1501 ALESASAIAISHTGVLYISETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVNCNCYS 1560 



35 QUERY: 1492 GDDAYATDAI LNS P SS LAVAPDGT I Y I ADLGN IR I RAVS KNKPVLNAFNQYEAASPGEQE 1551 

Mi M M M M M M M M M M M M M M M M M M l + l + M + M M M M M M I 

SBJCT: 1561 GDDGYATDAI LNS PSS LAVAPDGT I YIADLGNIR I RAVS KNRPILNSFNQYEAASPGEQE 1620 
QUERY* 1552 LYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMP 1611 

40 MMMMMMI + MMMMMMM + MMM++I + MMM + MI + MMMMM 

SBJCT: 1621 LYVFNADGIHQYTLSLVTGEYLYNFTYSSDNDVTEVMDSNGNSLKVRRDASGMPRHLLMP 1680 
QUERY: 1612 DNQIITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKSDETGWTTFYDYDHEGRLTN 1671 

MM + M MMMM + MM M M M M + M + M M M M M M M M M M M M M 

45 SBJCT: 1681 DNQIVTLAVGTNGGLKLVSTQTLELGLMTYNGNSGLLATKSDETGWTTFYDYDHEGRLTN 1740 
QUERY: 1672 VTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTWQDQVRNSYQLCN 1731 

MMM MM MM MM MM MMM MM MMMMM MM MM MM MMM 

SBJCT: 1741 VTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTWQDQVRNSYQLCN 1800 

50 

QUERY: 1732 NGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPME3STGLNSIEWRLRKEQIKGKV 1791 

MMMMMM MMMMMMI + MMMIMMIMMMMMMMMMMI 

SBJCT: 1801 NGTLRVMYANGMS I S FHS E PHVLAGTVTPT IGRCN I S L PMENGLNS I E WRLRKEQ I KGKV 1860 
55 QUERY: 1792 TIFGRKLRVHGRNLLSIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVN 1851 

HMMMMMMMMMMMMMMI M M M M M l + M M M M M M M M 

SBJCT: 1861 TVFGRKLRVHGRNLLSIDYDRNIRTEKIYDDHRKFTLRIIYDQLGRPFLWLPSSGLAAVN 1920 
QUERY: 1852 VSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYLDKSMVLLLQSQRQYI 1911 

60 M M M M M M M M M M M M M M l + M M M I M M l + M + M M M M M M M 

SBJCT: 1921 VSYFFNGRLAGLQRGAMSERTDIDKQGRIISRMFADGKVWSYTYLEKSMVLLLQSQRQYI 1980 
QUERY: 1912 FEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYSDDGRILKTSFL 1971 

MIMIIII IMMMMIMIMM + IIIIIIMMIIIIIMMMIIMMMIM 

65 SBJCT: 1981 FEYDSSDRLHAVTMPSVARHSMSTHTSVGYIRNIYNPPESNASVIFDYSDDGRILKTSFL 2040 
QUERY: 1972 GTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKXGPLV 2031 

M f M i I M 11 1 1 1 1 It M It 1 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 M i f 1 1 1 1 1 1 M 1 1 1 1 ! ! ! M 

SBJCT: 2041 GTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLV 2100 



70 
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QUERY: 203 2 DKQI YRFSEEGMVNARFDYTYHDNS FRIAS I KPVI SETPLPVDLYRYDE I SGKVEHFGKF 20 91 

MIIIMilMlllillMIIIMMIiltiil-HMIIIMMIIIiillllMIMM 

SBJCT: 2101 DKQI YRFSEEGMVNARFDYTYHDNS FRIAS I KP I ISETPLPVDLYRYDE I SGKVEHFGKF 2160 
5 QUERY* 2092 GVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTVQYDSMGRVIKRELKL 2151 

MIIMMIMIIMIIIMMMMIillllllMMIMMIIIIIIIMI 1 1 1 1 1 1 

SBJCT: 2161 GVI YYD INQI I TTAVMTLSKHFDTHGR I KEVQYEMFRSLMYWMT VQ YDSMGRVTKRELKL 2220 
QUERY: 2152 GPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDXXXXXXXXXXXXSVRLMPLRYDLRD 2211 

10 ]| mil Mi Ml it I MINI Mi I MM li ill M I II I M I M i I 

SBJCT : 2221 GP YANTTKYTYDYDGDGQLQSVAVNDRPTWRYS YDLNGNLHLLNPGNS VRLMPLRYDLRD 22 80 
QUERY: 2212 RITRLGDVQYKIDDDGYLCQRGSDI FEYNS KGLLTRAYNKASGWS VQYRYDGVGRRAS YK 2271 

MMMk 1 1 1 1 1 1 M M M I M M H 1 1 1 1 1 i 1 1 1 1 M l + M i I M l + l 1 1 1 1 I 

15 SBOCT: 2281 RITRLGDIPYKIDDDGFLCQRGSDVFEYNSKGLLTRAYNKANGWNVQYRYDGLGRRASCK 2340 
QUERY: 2272 TNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVASDNT 2331 

Mill i Ml IMMMIMM MM III MM MM I II MM III I Ml III Mill 

SBJCT: 2341 TNLGHHLQYFYADLHNPTRVTHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVASDNT 2400 



20 



60 



QUERY: 2332 GTPLAVFS INGLMIKQLQYTAYGE I YYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDV 23 91 

MIIMIIIIIIMMIIIIIMIIIIIIIMIkMIIMIMIIIIIIMMIIIIII 

SBJCT: 2401 GTPLAVFS INGLMIKQLQYTAYGE I YYDSNPDFQLVIGFHGGLYDPLTKLVHFTQRDYDV 2460 



25 QUERY- 2392 LAGRWTSPDYTMWKOTGKEPAPFNLYMFKSNNPLSSELDLKNYVTDVKSWLVMFGFQLS^ 2451 

IIIIMIIMIMIM + MIIMillMIIIIIMIIMilllllilMIMIIIIIi 

SBJCT: 2461 LAGRWTSPDYTMWKNIGREPAPFNLYMFKSNNPLSNELDLKNYVTDVKSWLVMFGFQLSN 2520 
QUERY: 2452 IIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAFMALEGQVITKKLHAS 2511 

30 MIMMMMM IIIIMill MiillllllililiMIIIIIIIIMi + MIII 

SBJCT: 2521 IIPGFPRAKMYFVSPPYELTESQACENGQLITGVQQTTERHNQAFMALEGQVISKRLHAS 2580 

QUERY: 2512 IREKAGHWFATTTP I IGKG IMFAI KEGRVTTGVS S I ASEDSRKVASVLNNAYYLDKMHYS 2571 

I I I II I II II l + l II M I I M I l+t + i I I II l + ll I l ++ l II l + ll II l + l + l l + l I I II 

35 SBJCT: 2581 IREKAGHWFATSTPIIGKGIMFAVKKGRVTTGISSIATDDSRKIASVLNSAHYLEKMHYS 2640 

QUERY: 2572 IEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLLVNGRTRRFTNIEFQY 2631 

MMMIMMIMM Mill I II M Mi II II I II I II M 1 1 1 1 1 II II I II I 

SBJCT: 2641 I EGKDTHYFVKI GSADSDLVTLAMTSGRKVLDSGVNVTVSQPTLL INGRTRRFTNI EFQY 2700 

40 

QUERY: 2632 STLLLSIRYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQ 2691 

MM++IMIII MIIIIilllMllilMlMMilillllililMMMIIil 

SBJCT: 2701 STLLINIRYGLTADTLDEEKARVLDQARQRALGSAWAKEQQKARDGREGSRWTDGEKQQ 2760 
45 QUERY: 2692 LLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2733 

I I + M j E M I It i It I S I M ( 1 1 M E 11 M M f 1 1 1 M 1 1 i I 

SBJCT: 2761 LLNTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2802 

The FCTR3bcde and f amino acid sequences have 1524 of 2352 amino acid residues 
50 (64%) identical to, and 1881 of 2532 residues (79%) positive with, the amino acid residues 429- 
2771, 93 of 157 residues (59%) identical to and 1 18 of 157 residues (74%) positive with amino 
acid residues 1-155, and 59 of 152 residues (38%) identical to and 68 of 152 residues (43%) 
positive with amino acid residues 21 1-361 of Ten-m4 [Mus musculus] (ptnr: GenBank Ace: 
BAA77399.1) (SEQ ID NO:70) (Table 3R). 

55 Table 3R. BLASTP of FCTR3b, c, d, e, and f against Mus musculus Ten-m4 - (SEQ ID 

NO:70) 



> GI | 4760782 | DBJ [ BAA77399 . 1 [ (AB025413) TEN-M4 [MUS MUSCULUS] 
LENGTH = 2771 

SCORE = 3089 BITS (8008), EXPECT = 0.0 

78 15966-697 
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30 



50 



70 



IDENTITIES « 1524/2352 (64%), POSITIVES = 1881/2352 (79%), GAPS = 28/2352 (1%) 
QUERY * 401 KNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPP 460 

++ | II +IIII Milk I I IlikMII II kl kMM 

SBJCT: 429 EDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGIYGRKGLPP 488 
QUERY- 461 SHAQYDFMERLDGK E KWS WE S PRERRS I QTLVQNEAVFVQ YLDVGLWHLAFYND 515 

|| | + |kl IM+ Ik 1+ I -I Mill kMMMM 

SBJCT: 4 89 SHTQFDFVELLDGRRLLTQEARSLEGPQRQSRGPVPPSSHETGFIQYLDSGIWHLAFYWD 54 8 
QUERY- 516 GKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQ 575 

|| + kill I ++M HI Ikllkkll I II I 1 1 1 1 II + klllMIIIII 

SBJCT: 54 9 GKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCPVLCSGNGQ 608 



15 QUERY • 576 YSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSAGYKGEHCEEVDCLDPT 635 

I II I klllllMIIII Mill +1 IkSI I kk Mill MIMkIM 

SBJCT: 609 YMKGRCLCHSGWKGAECDVPTNQCIDVACSSHGTCIMGTCICNPGYKGESCEEVDCMDPT 668 
QUERY: 63 6 CSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSV 695 

20 Ml MM Ml M MM Ml I I II M 1 1 1 kll II M kll kl I Ilk 

SBJCT: 669 CSSRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCNCDPSWTGHDCSI 72 8 
QUERY * 696 EVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHC 755 

kk III lllkll Mlkll MMIM Mill MllkllMM Mill: 

25 SBJCT: 729 EICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECSPGWNGEHC 788 

********* 

QUERY * 756 TIGRQTAGTETDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEG 815 

II +1 M I II II M M I I I MM MM ik +111 I I Mkl 

SBJCT: 789 TIAHYLDRWKEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETGCGDGKDNDG 848 



QUERY : 816 DGLVDCLDPDCCLQSACQNSLLCRGSRDPLDI IQQGQT- - DWPAVKSFYDRI KLLAGKDS 873 

M I II kl I II I M I + II II ilililk I + MMIM kll 

SBJCT: 849 DGLVDCMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNPFYDRIKFLVGRDS 90 8 



35 QUERY - 874 THIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVS FVKYPKYGYTITRQDGTFDLIANG 933 

II II II II k + MMkkllllliMklk I + IMklllkMk II 

SBJCT: 909 THSIPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFINNPLFGYTISRQDGSFDLVTNG 968 
QUERY- 934 GASLTLHFERAPFMSQERTVWLPWNSFYAMDTLVMKTEENSIPSCDLSGFVRPDPIIISS 993 

40 I |+ I MMIk+li killk k kklk III MMIM I lkk+ I 

SBJCT: 969 GISIILRFERAPFITQEHTLWLPWDRFFVMETIVMRHEENEIPSCDLSNFARPNPWSPS 1028 
QUERY* 994 PLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKITMTQSTVPL 1053 

lk+| k + MM! I I Ml + I ++I MIMI lllkkk+l kl 

45 SBJCT: 1029 PLTSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSVLRISLTHPTIPF 1088 
QUERY: 1054 NLIRVHLMVAVEGHLFQKSFQASPNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCP 1113 

II++MIIIMII Ikl I kkk MMIM I kkl kl MlkllkM 

SBJCT: 1089 NLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQiCVFGFSEAFVSVGYEYESCP 1148 



QUERY: 1114 SLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSI 1173 

MIIIIMklikkl I IMMIMM Ik+Miilll Milk+IM + 1 II 

SBJCT: 1149 DLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQQPPVIGSI 120 8 



55 QUERY: 1174 MGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILEL 1233 

I M M I M M M I II II kll M 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llkllk 

SBJCT: 1209 MGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSGNVTNILEM 1268 
QUERY: 1234 RNKEFKHSNNPAHKYYLAVDPVSGSLWSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGE 1293 

60 lll + l + ll ++l MIMM lklk+++IIMMk++lll + Ml MIMMIk 

SBJCT: 1269 RNKDFRHSHSPAHKYYIATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNSEWAGTGD 1328 
QUERY: 1294 QCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGSND 1353 

Illllk IIIIMM +111 +IMI III IkMM IMklllMIIIIIIIIII 

65 SBJCT: 1329 QCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRVDQNGIISTLLGSND 1388 
QUERY: 1354 LTAVRPLS CDS SMDVAQVRLEWPTDLAVNPMDNSLYVLENNVI LRITENHQVS 1 1 AGRPM 1413 

|| + MMIM I+++IIIIIIIIIIMIIIIIIIII + IIM+MIIII MINI 

SBJCT: 13 89 LTSARPLSCDSVMEISQTOLEWPTDLAINPMDNSLYVLDNIWVLQISENHQVRIVAGRPM 144 8 
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QUERY: 1414 HCQVPGID-YSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINRLRQVTTNGEICLL 1472 

Illllili + Ml Mill IMMMM + IMM + IM 1 + 

SBJCT: 1449 HCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTTSGEISLV 1508 

5 QUERY: 1473 AGAASXXXXXXXXXXXXYSGDDAYATDAI LNS PSSLAVAPDGT I Y I ADLGNI RI RAVSKN 1532 

ill | +| | I I i I II I l + l I I I I I M + I + MIIIIIII + M 

SBJCT: 1509 AGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKN 1568 

QUERY* 1533 KPVLNAFNQYEAAS PGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTEL IDNNG 1592 

10 || i| | || + || + HII + I + I i M M M+MIIIM+ I 1 + 1 + I I I I 

SBJCT: 1569 KPFLNTQNMYELSSPIDQELYLFDTSGKHLYTQSLPTGDYLYNFTYTGDGDITHITDNNG 1628 

QUERY - 1593 NS LKIRRDS SGMPRHLLMPDNQ I ITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKS 1652 

I + +IMI + III I++II 1+ + I + IM 1+ l + M M +111 M + IMMM 

15 SBJCT: 162 9 NMVNTORDSTGMPLWLWPDGQVYOTTMGTNSALRSVT^^ 1688 

QUERY- 1653 DETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSV 1712 

+ | MIMI+II iilMII III l + l + + i+ + +1 I++ 1 1 1 1+ IMI + 
SBJCT * 1689 NENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSK-DDVTITTNLSAS 1747 

20 

QUERY * 1713 EASYTWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPME 1772 

I M++iliilMI + +I + II++ Ml! ++ +IM + IIII+ H + I+ I++II++ 

SBJCT: 1748 GAFYTLLQDQVRNSYYIGADGSLRLLIJ^GMEVALQTEPHLIiAGTVNPTVGKRXT^LPID 1807 
25 QUERY: 1773 NGLNS IEWRLRKEQIKGKVTIFGRKLRVHGRNLLS IDYDRNTRTEKIYDDHRKFTLRI IY 1832 

MM +111 MM + I + II + IM + IIM IMM + I + II I II I M I M I II 1 1 ll + l 

SBJCT: 1808 NGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILY 1867 
QUERY * 1833 DQVGRPFLWLPSSGIAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWS 1892 

30 || Ml II III I ill + l I +M + IM MM + 1+ Ml M + MMI + II 

SBJCT: 1868 DQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIFADGKMWS 1927 

QUERY: 1893 YSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESN 1952 

l + M + IMM I llllllll + l +111 +IIII + III ++ I l + M MM III I 
35 SBJCT: 1928 YTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGN 1987 

QUERY- 1953 ASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVN 2012 

MM I+++M +1 I +IIIII + I IMMMM + I +11 + 1 l + l MM l + ll II 

SBJCT: 198 8 ASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVN 2047 
QUERY: 2013 LQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKPVISETPLP 2072 

11+ M + IMMI + MM + I + M + M + MMMIMM I IMM+ I++ ll + IMM 

SBJCT: 2048 LQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNY-DNSFRVTSMQAVINETPLP 2106 
45 QUERY: 2073 VDLYRYDE^SGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMY 2132 

+ MIIM++M1 I MMMMMMMMIMM +1111 + IMIMII + IIIIM 

SBJCT: 2107 IDLYRYDDVSGKTEQFGKFGVIYYDINQI ITTAVMTHTKHFDAYGRMKEVQYE IFRSLMY 2166 
QUERY: 2133 WMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDXXXXXX 2192 

50 lllllll + llll + l + lll + lllllll + l + l + ll IMM + I++M + I llllll 

SBJCT: 2167 WMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDIjNGNLH 2226 
QUERY: 2193 XXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKA 2252 

I II IIIIMMMMIIIMII + I + II + I Ml l + IMM III +MI + I 

55 SBJCT: 2227 LLS PGNSARLTPLRYDLRDRI TRLGDVQYKMDEDGFLRQRGGDVFE YNSAGLL I KAYNRA 2286 
QUERY: 2253 SGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGH 2312 

IMII + IMM + IM I I++ llll + ll + ll III++II + IIII + IIIIIIIIIMII 

SBJCT: 2287 SGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLYYDLQGH 2346 



60 

QUERY: 2313 LFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHG 2372 

Mill Ml + I + I + I II IMMMI IMIII+ Illllili I + M + II++II + M 

SBJCT: 2347 LFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQI IIGYHG 2406 
65 QUERY: 23 73 GLYDPLTiaVHFTQRDYDVIAGRWTSPDYTMWKNVGKEP-APFNLYMFKSNNPLSSELDL 2431 

IMMIIMM +IIIIIMIIIIMI+ +11 + M + MIM + IM + I+ 1 + 

SBJCT: 2407 GLYDPLTKLVHMGRRDYDVIAGRWTSPDHELWKRLSSNSIVPFHLYMFKNNNPISNSQDI 2466 
QUERY: 2432 KNYVTDVKSWLVMFGFQLSNI IPGFPRAKMYFVPPPYELSESQAS ENGQL I TGVQQ 2487 

70 | ++ ||| 111+ Mill I + IM + I+ + I Ml +1 +1 + 1 III 

80 15966-697 
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SBJCT: 2467 KCFMTDVNSWLLTFGFQLHNVIPGYPKPDTDAMEPSYELVHTQMKTQEWDNSKSILGVQC 2526 
QUERY * 2488 TTERHNQAFMALE GQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRVT 2541 

++ + || + || III I +| II++ I 111+ M + MIM 

SBJCT: 2527 E VQKQLKAF VTLERFDQLYGS TITS CQQAPETKK FAS SGS I FGKGVKFALKDGRVT 2582 

QUERY* 2542 TGVS S IASEDSRKVASVLNNAYYLDKMHYS IEGKDTHYFVKIGSADGDLVTLGTT IGRKV 2601 

I + M + M I++I++IIIMI + MMMi i ++ IM II + U + 

SBOCT: 2583 TDIISVANEDGRRIAAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGRRT 2642 
QUERY- 2602 LESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQARQR 2661 

IMIMIIM + M + M I l + Ml Mill 111+ Mil 

SBJCT: 2643 LENGVNVTVSQINTMLSGRTRRYTDIQLQYRALCLNTRYG TTVDEEKVRVLELARQR 2699 

15 QUERY: 2662 ALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPEIADSSSN 2721 

h IM + IM+ l + l M I II + IIIII + MIIIIIM++I MIMM + II++I 

SBJCT: 2700 AVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANN 2759 
QUERY: 2722 IQFLRQNEMGKR 2733 

20 | l + IMM + l 

SBJCT: 2760 IHFMRQSEMGRR 2771 



25 



SCORE « 161 BITS (407) , EXPECT = 2E-37 

IDENTITIES = 93/157 {59%), POSITIVES = 118/157 (74%) , GAPS = 4/157 (2%) 
QUERY - 1 MDVKDRR-HRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGN 59 

HM + I + +11 1 II I I llllll Ml + + I IMMMIIIIMI 1 + 1+ 11 + 

SBJCT: 1 MDVKERKPYRSLTRRR-DAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGS 58 

30 QUERY- 60 RVTDLIHRESDEFPRQGTNFTLAELGICEPS - PHRSGYCSDMGILHQGYSLSTGSDADSD 118 

| | |++ +I++M I llllll 111+ I + II + I + 1 + 1 + I 111+ MM + 
SBJCT: 59 RVKDMVPQEAEEFCRTGTNFTLRELGLGEMTPPHGTLYRTDIGLPHCGYSMGASSDADLE 118 

QUERY: 119 TEGGMSPEHAI RLWGRG I KSRRS SGLSSRENS ALTLT 155 

35 + +MM +IIMI +1 III MM II MM 

SBJCT: 119 ADTVLS PEHP VRLWGRSTRSGRSS CLSSRANSNLTLT 155 



40 



SCORE = 72.1 BITS (176), EXPECT = 8E-11 

IDENTITIES = 59/152 (38%), POSITIVES = 68/152 (43%), GAPS - 42/152 (27%) 

QUERY: 285 PAPAPND- -LATTP ESVQLQDSWVLNSNVPLETR 316 

l + IM I 1+ I I M+I + MM + IMM 

SBJCT: 211 PSPAPTDHSLSGEPPAGSAQEPTHAQDNWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDI 270 

45 QUERY- 317 HFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRKAFK 3 63 

IMII II I M MMMI lll + IMI II 

SBJCT: 271 LSASRHDGAYSDGHFLFK-PGGTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFN 329 
QUERY: 364 LKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 3 95 

50 1 1 1 1 1 1 1 1 + ! I i 11 + 

SBJCT: 330 L KKP S KYCNWKCAAL S A I L I SATLV I LLAY F V 361 
*FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 

55 The 997-2733 amino acid fragment of the FCTR3bcde and f protein was also found to 

have 1695 of 1737 amino acid residues (97%) identical to, and 1695 of 1737 residues (97%) 
positive with the amino a 1737 amino acid residue protein KIAA1 127 protein [Homo sapiens] 
(GenBank Acc:(AB032953) (SEQ ID NO:71), (Table 3S). 

Table 3S. BLAST? of FCTR3b, c, d, e, and f against Homo sapiens KIAA1127 protein 
60 (SEQIDNO:71) 

> GI | 6329763 [ DB J | BAA8 6 4 4 1 . 1 1 (AB03 2 953) KIAA1127 PROTEIN [HOMO SAPIENS] 
LENGTH = 173 7 



81 
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SCORE = 3295 BITS (8545), EXPECT =0.0 

IDENTITIES = 1695/1737 (97%) , POSITIVES = 1695/1737 (97%) 
QUERY- 997 TFFSAAPGQNP I VPETQVLHEE I ELPGSNVKLRYLS SRTAGYKS LLKI TMTQSTVPLNLI 1056 

5 IIIIIIMIlllMMIIIllllMIMIIIMMIilMIMIMIMIMlMMIII 

SBJCT: 1 TFFSAAPGQNP IVPETQVLHEE I ELPGSNVKLRYLSSRTAGYKSLLKI TMTQSTVPLNLI 60 
QUERY * 1057 RVHLMVAVEGHLFQKSFQASPNLASTFIWDKTDAYGQRVYGLSDAWSVGFEYETCPSLI 1116 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 M 1 E 1 1 1 1 1 1 MIIIIIIIIIMMIMIIIMIIIMIIIllll 

10 SBJCT: 61 RVHLMVAVEGHLFQKSFQAS PNLAYTF IWDKTDAYGQRVYGLSDAWSVGFEYETCPSLI 120 
QUERY * 1117 LWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAI ITS IMGN 1176 

IIIIMMMIMIIMIIIIMIIMIIMMMIIIMIIIMMMIMMHIIII 

SBJCT: 121 LWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGN 180 



15 



55 



QUERY * 1177 GRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNK 1236 

MIMMIMIMiMMIMMIMIillllilllilMlllllMIMIIIIIIIMI 

SBJCT: 181 GRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNK 240 



20 QUERY- 1237 EFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGEQCL 1296 

I { 1 1 M 1 1 f 11 1 1 1 M I M M 1 1 M tt ! 1 1 1 M M 1 1 M 1 M 1 1 i M ] M 1 1 i I! 1 1 1 M 

SBJCT: 241 EFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGEQCL 300 
QUERY * 1297 P FDEARCGDGGKAIDATLMS PRG IAVDKNGLMYFVDATMIRKVDQNGI I STLLGSNDLTA 1356 

25 * i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I II I II 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

SBJCT: 301 P FDEARCGDGGKAIDATLMS PRG IAVDKNGLMYFVDATMIRKVDQNGI I STLLGSNDLTA 360 
QUERY* 1357 VRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRITENHQVS I IAGRPMHCQ 1416 

IIMIIIIIMIIIIMIIIIIIMMIIIIIIIMIIIIIMIIIIMIMMIIIIII 

30 SBJCT: 361 VRPLS CDS SMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILR I TENHQVS I IAGRPMHCQ 420 
QUERY: 1417 VPGIDYSLS KXXXXXXXXXXXXXXXXXTGVLY ITETDEKKINRLRQVTTNGE I CLLAGAA 1476 

IIIIIIMM 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 f 

SBJCT: 421 VPGIDYSLSKLAIHSALESASAIAISHTGVLY ITETDEKKINRLRQVTTNGE I CLLAGAA 480 

35 

QUERY: 1477 SXXXXXXXXXXXXYSGDDAYATDAI LNSPS SLAVAPDGT I Y I ADLGNI RI RAVSKNKPVL 153 6 

I NIIIIIIIIIIIMIIIMIIMIIIIIIIIIIIIMMMMIII 

SBJCT: 481 SDCDCKNDVNCNCYSGDDAYATDAILNS PS SLAVAPDGTIYI ADLGNI RIRAVSKNKPVL 540 
40 QUERY * 1537 NAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLK 1596 

IIIIIIIIIIMIIIIIIMIIIIIIIIMIIIIMIIIIIlllllMllllllllilM 

SBJCT: 541 NAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLK 600 
QUERY * 1597 IRRDSSGMPRHLLMPDNQ I ITLTVGTNGGLKWSTQNLELGLMTYDGNTGLLATKSDETG 1656 

45 1 1 1 1 1 1 1 M II II II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 

SBJCT: 601 IRRDSSGMPRHLLMPDNQ I I TLTVGTNGGLKWSTQNLELGLMT YD GNTGLLATKSDETG 660 
QUERY: 1657 WTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASY 1716 

1 1 1 } 1 1 1 1 1 1 1 1 1 1 1 1 f M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i I E 1 1 1 1 It I i 1 1 1 1 

50 SBJCT: 661 WTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASY 72 0 
QUERY- 1717 TWQDQVRNSYQLCNNGTLRA^MYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLN 1776 

MIMIIIIIIIIMIIMMMIIIIMMIIIMIIIMMIIMMMMIMIIII 

SBJCT: 721 TWQDQVRNS YQLCNNGTLRVMYANGMGI S FHSEPHVLAGTITPTIGRCN I SLPMENGLN 780 



QUERY: 1777 S I EWRLRKEQ I KGKVT I FGRKLRVHGRNLLS I DYDRNI RTEKI YDDHRKFTLRI I YDQVG 1836 

IIIMMMIIMIIIIIIIIIIIMIIIIIIMIIIIIIIIMMIIMMMMIIII 

SBJCT: 781 SI EWRLRKEQ I KGKVT I FGRKLRVHGRNLLS I DYDRNI RTEKI YDDHRKFTLRI I YDQVG 840 



60 QUERY: 1837 RPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYL 1896 

I1MIIMIIIIIIMIIIIIIIIIIIMIIMIIIIMIIIMIIIIIIIMIIIIMI 

SBJCT: 841 RPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYL 900 

QUERY: 1897 DKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVI 1956 

65 | | 1 | | | | | | | | | | | || | || || M | | | | | | | | | | | | | | | | || | | || I II I I I I I I I I I I I I 

SBJCT: 901 DKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVI 960 

QUERY: 1957 FDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSG 2016 

IIMMIillllllMIIIIIMIIMIIIMIIMMIIIIIIIMIIIMIIIIIIII 

70 SBJCT: 961 FDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSG 1020 

82 15966-697 



QUERY: 2017 GFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKPVISETPLPVDLY 2076 

IMIMIIIIIIIIIIMMMIIMIIIMIIMMIIIMMMIMIIIIMMIM 

SBJCT: 1021 GFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIKPVISETPLPVDLY 1080 

5 

QUERY: 2077 RYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMTV 2136 

MMIMMMIIIIIIIIMMIIIIIIIIIIMMIIMMMIMIMMIIMM! 

SBJCT: 1081 RYDE I SGKVEHFGKFGVI YYDINQI ITTAVMTLSKHFDTHGR I KEVQYEMFRSLMYWMTV 1140 
10 QUERY: 2137 QYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDXXXXXXXXXX 2196 

MIIIIIIIIIIIIIIMIIMIIIIIIMIMIMIIIIUIIIIIIII 

SBJCT : 1141 QYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLHLLNP 1200 
QUERY * 2197 XXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKASGWS 2256 

15 IIIIIIIIIIIIIMIMIMIIIIIIMIMIMMIMIIIIIIIIIIIIIMIM 

SBJCT: 1201 GNSWLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKASGWS 1260 
QUERY* 2257 VQYRYDGVGRRAS YKTNLGHHLQYFYSDLHNPTRI THVYNHSNSE ITSLYYDLQGHLFAM 2316 

MIMIIIIIIIIIMIMlMlillMMMMMMIIIMIIMIMIIIMIIMI 

20 SBJCT: 1261 VQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTR I THVYNHSNSE ITSLYYDLQGHLFAM 1320 
QUERY- 2317 ESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFHGGLYD 2376 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E I E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 M 1 1 1 f 1 1 1 1 1 1 1 1 

SBJCT * 1321 ES SSGEE YYVASDNTGTPLAVFS INGLM IKQLQYTAYGE I YYDSNPDFQMVIGFHGGLYD 1380 

25 

QUERY * 2377 PLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSNNPLSSELDLKNYVT 2436 

MIMIMllllMMIIMMIIIMIINlMMIMMIIIIIIMMIiMllIM 

SBJCT: 1381 PLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSNNPLSSELDLKNYVT 1440 
30 QUERY: 243 7 DVKS WLVMFGFQLSNI I PGFPRAKMYFVPPPYELSESQASENGQL I TGVQQTTERHNQAF 2496 

IIIIMIMIIIIIIIIIIMMIIIMIIIIMMlMIMMMIIIIIMMIIll! 

SBJCT: 1441 DVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGVQQTTERHNQAF 1500 
QUERY: 2497 MALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRVTTGVSSIASEDSRKVA 2556 

35 MillllllllMIIIIIMUMMIIMIlMIMMMMMMIIIlllMMIII 

SBJCT: 1501 MALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGRVTTGVSSIASEDSRKVA 1560 
QUERY: 2557 SVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLL 2616 

1 1 1 1 1 1 1 f 1 1 1 ! I M i M 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I i 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 

40 SBJCT: 1561 SVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLL 1620 
QUERY: 2617 VNGRTRRFTN I E FQYS TLLLS I RYGLTPDTLDE E KAR VLDQARQRALGTAWAKEQQKARD 2676 

> 1 : 1 1 1 M M 1 1 1 1 ! M 1 1 1 : 1 M I M M 1 1 ; M M I M ; M 1 1 1 M ! 1 : 1 M 1 1 1 1 1 ! I 

SBJCT: 1S21 VNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKAR VLDQARQRALGTAWAKEQQKARD 1680 

45 

QUERY: 2677 GREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 2733 

llllllllllllllllllllllllllilMMIIIIllMllllillllMIIMII 

SBJCT: 1681 GREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQNEMGKR 173 7 

50 The amino acid sequences of the FCTR3bcde and f proteins were also found to have 

2528 of 2774 amino acid residues (91%) identical to, and 2557 of 2774 residues (92%) positive 
with, the 2765 amino acid residue protein neurestin alpha [Rattus norvegicus] (GenBank 
Acc:AF086607) (SEQ ID NO:72) ? shown in Table 3T. 



Table 3T. BLASTP of FCTR3bcd and f against Rattus norvegicus Neurestin alpha (SEQ ID 
55 NO:72) 

> GI [ 9910320 [ REF [N? 064473 . 1 [ NEURESTIN ALPHA [RATTUS NORVEGICUS] 
GI 1 5712201 [ GB [ AAD473 83 . 1 i AF086607 1 (AF086607) NEURESTIN ALPHA [RATTUS NORVEGICUS] 
LENGTH = 2765 

60 SCORE = 4988 BITS (12938), EXPECT - 0.0 

IDENTITIES = 2528/2774 (91%), POSITIVES - 2557/2774 (92%), GAPS = 50/2774 (1%) 

QUERY : 1 MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNR 60 

83 15966-697 



10 



15 



20 



ILKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 395 

25 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 



30 



35 



40 



50 



55 



60 



70 



SBJCT: 


1 


QUERY: 


61 


SBJCT: 


61 


QUERY: 


121 


SBJCT: 


121 


QUERY : 


181 


SBJCT: 


181 


QUERY: 


241 


SBJCT: 


241 


QUERY : 


301 


SBJCT: 


301 


QUERY : 


361 


SBJCT : 


361 


QUERY : 


396 


SBJCT : 


421 


QUERY : 


440 


SBJCT : 


481 


QUERY: 


500 


SBJCT : 


541 


QUERY: 


560 


SBJCT : 


601 


QUERY : 


620 


SBJCT : 


661 


QUERY: 


680 


SBJCT : 


721 


QUERY : 


740 


SBJCT : 


781 


QUERY : 


800 


SBJCT : 


832 


QUERY : 


860 


SBJCT : 


892 


QUERY: 


920 


SBJCT : 


952 


QUERY: 


980 


SBJCT : 


1012 



IIIIIIIIIIIIIIIIMIIMIIIIIIIMMIIIIIIMIMIMMIIIIIIIMI! 

MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNR 6 0 
VTDLIHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 12 0 

IIM + MIMM III IMiliMllllilllllllllMllllilMMIIIIMIII 

VTDLVHRESDEFSRQGAWFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 120 
GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTXXXXXXXXXXXXGRXXXXXXXXXXX 180 

MlliliMIIMMMIIIIIMMMlMIMI 1 1 

GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNGRPIPPTSSSSLL 180 
XXXXXXXXHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACXXXXXXXXXXXX 240 

MMIIIMIIIIIMMIIIIIMIMIIIIIIIIIII! 

PSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPP 240 
NHHSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ IHAPAPAPNDLATTPES VQ 3 00 

Mill IIIIIIMMIllllllill 

NHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPAPNDLATTPESVQ 3 00 
LQDS WLNSNVPLETRHFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRK 360 

IIIIIIIIIIIIIIIMIIII IIIIIIIIIMIMIIMMIIII 



MMUMIMMMllMMMMMMMIIIIIilllMM 

LPGNDDVATVPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 
^JISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 

IIIIIIIIIIIMMIIIIIMIIIIIIIIIIIMIIIMIMIIIIIIIIIIIIIIII 

JTISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 
SYLDVGLWHLAFYNDGKDKEiWSFNTVVXiDSVQDCPRNCHGNGECVSGVCHCFPGFLGA 

MMMMMMMMMMMMMMMIIMMMMMMMIMIilllilM 

QYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGLCHCFPGFLGA 
CAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSA 

S M 1 1 1 1 1 1 1 1 1 1 } I i 1 1 M i 1 S 1 1 1 1 1 1 1 1 M S 1 1 1 1 i 1 1 1 1 M 1 1 11 M i 1 1 1 1 l-^l 

CAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCAA 
YKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDT 

45 " Ml Ml I III III I III INI Ml I III III I III Mil III I III II 1 1 III I III 11 + 

SCLCSPGWGGLNCELARVQCPDQCSGHGTYLPDS 
3VC IGGACRCEEGWTGAACDQRVCHPRC IEHGTC 

IMMIIMIIIIMMIIIMIIIMMIIMI 

GVC IGGACRCEEGWTGAACDQRVCHPRC I EHGTC 
IDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 

1 1 M 1 1 1 1 1 1 II II MMMMMMMMMMMMMMIMM 

^ECREGWNGEHCT I DGCPDLCNGNGRCTLGQNS WQCVCQTGWRGPGC 

STSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVK 

I i i 1 1 1 i 1 1 M f 1 1 1 i 1 1 1 i i 1 1 1 1 1 1 1 1 1 i E 1 E 1 1 1 i ! M 1 1 1 1 i 1 1 1 1 1 M M 1 

ETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVK 

RIKLliAGKDSTHIIPGENPFNSSLVSLIRGQWTTDGTPLVGVKVSFVKYPKYGYT 

II I II I II M II M ll + l I M I I II II I I I I I I II II II I I II M II M II I M II 
RIKLLAGKDSTHI I PGDNPFNSSLVSLIRGQWTTDGTPLVGVWVS FVKYPKYGYT 

DGTFDLIANGGASLTLHFERAPFMSQERTVWLPWNSFYAMDTLViyiKTEENSIPSCD 

65 lll!lllllllllll++llllillllll!+lllll MIIIIIIIIIIIIMMIIIMI 



********* 



IMIMMMMiiMIIMI-H MIMMMMMMMMIMMMMMMI 

jSGFVRPDPIIISSPLSTFFSASPAANPIVPETQVLHEEIELPGTNVKLRYLSSRTAGYK 1071 
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QUERY * 1040 S LLKI TMTQSTVPLNL I RVHLMVAVEGHLFQKS FQAS PNLASTF I WDKTDAYGQRVYGLS 1099 

1 1 i I f f f f I f 1 1 1 1 ! I i t i i 1 1 1 1 1 1 1 f 1 i M f f 1 1 1 1 1 1 1 IMIIIillillMMM 

SBJCT- 1072 SLLKITMTQSTVPLNLI RVHLMVAVEGHLFQKS FQAS PNLAYTF I WDKTDAYGQRVYGLS 1131 

5 

QUERY- 1100 DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGE 1159 

1 1 1 1 1 1 ! 1 1 1 1 1 j 1 1 1 1 i 1 1 1 l 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r 1 1 1 1 1 1 iiiiMii inn 

SBJCT: 1132 DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHTLNVKSGILLKGTGE 1191 
10 QUERY * 1160 NQFLTQQPAI ITS IMGNGRRRS ISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIR 1219 

IIMMMIIMIIIUMMiniillMIIIIMIMMMMIIIIIMIIIMM 

SBJCT: 1192 NQFLTQQPAI ITS IMGNGRRRS IS CP SCNGLAEGNKLLAPVALAVG I DGSLFVGDFNYIR 1251 
QUERY • 1220 RIFPSRNVTSILELRNKEFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTK 1279 

15 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 + 1 1 1 1 1 1 1 1 1 M + l 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 M I 

SBJCT: 1252 RIFPSRNVTSILELRNKEFKHSNSPGHKYYLAVDPVTGSLYVSDTNSRRIYRVKSLSGAK 1311 
QUERY * 1280 DLAGNSEWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKV 1339 

1 1 1 } 1 1 1 1 f 1 1 1 1 1 1 1 1 i 1 1 1 1 f J 1 1 1 1 f E ^ 1 1 i 1 1 i 1 1 1 1 i 1 i ! 1 1 1 1 1 1 1 1 1 E I i 1 1 1 

20 SBJCT: 1312 DLAGNSEWAGTGEQCLP FDEARCGDGGKAVDATLMSPRGI AVDKNGLMYFVDATM IRKV 1371 
QUERY: 1340 DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1399 

IIIIIIMIIIIIIIIIIIMMIIIIIIMMIIIIMIIIIMIIIIIIIIMiMM 

SBJCT * 13 72 DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1431 

25 

QUERY • 1400 TENHQVSIIAGRPMHCQVPGIDYSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINR 1459 

MlllilMMIIMMIIIIIIIlll IIMIMNMIIIII 

SBJCT: 1432 TENHQVS I IAGRPMHCQVPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKINR 14 91 
30 QUERY * 1460 LRQVTTNGE I CLLAGAASXXXXXXXXXXXXYSGDDAYATDAILNS PSSLAVAPDGT I YIA 1519 

i i I ^ ; ■ MMMIIIillililllllMlllillil 

SBJCT: 1492 LRQVTTNGE I CLLAGAASDCDCKNDVNCICYSGDDAYATDAILNS PSSLAVAPDGT I YIA 1551 

QUERY: 1520 DLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1579 

35 | | | | M | | | | | || | | | | | | | || | | | | | | M | II I I I I I I I I I I I I II I II I I I I I I ! I I I 

SBJCT: 1552 DLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1611 

QUERY * 1580 TDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLM 1639 

1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIIMI1III 

40 SBJCT: 1612 ADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLM 1671 
QUERY- 1640 TYDGNTGLLATKSDETGWTTFYD YDHEGRLTNVTRPTGWTSLHREMEKS I T I D I ENSNR 1699 

MIIIIIIIIIMIMIMIIMIMIIMIMIIIIIMIIMIIIIIIIMIIIIII 

SBJCT- 1672 TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITVDIENSNR 1731 

45 

QUERY* 1700 DDD VT V I TNLSS VE AS YTWQDQ VRNS YQLCNNGTLRVM YANGMG I S FHS E PHVLAGT I T 1759 

Mllllilllilllillilillllllll!l + MMMIIIIIIMMIIMMMM 

SBJCT: 1732 DNDVTV I TNLS S VEAS YT WQDQVRNS YQLCSNGTLRVMYANGMGVS FHS E PHVLAGTLT 1791 
50 QUERY: 1760 PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1819 

IIIIMIIMMMIIIIMIIIMMIIIIIIIMMIIIIIMIIMIIIIMIIIII 

SBJCT: 1792 PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1851 

QUERY: 1820 YDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1879 

55 | | | | | | | | | M || | | | | || | | | | | | || | | | | | | | | I II M I I I I I I I I I I I I I I i I I I I I 

SBJCT: 1852 YDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1911 

QUERY: 1880 IVSRMFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSI 193 9 

IIMIIIIIIIIIIIIIIIIIMMMIIMIIIMIIMI 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 1 

60 SBJCT: 1912 IVSRMFADGKVWSYSYLDKSMVLLLQSQRQYIFEYDSSDRLHAVTMPSVARHSMSTHTSI 1971 
QUERY: 1940 GY IRNI YNP PESNASVI FDYSDDGRI LKTSFLGTGRQVFYKYGKLSKLS E I VYDS TAVTF 1999 

IMIIIIIIIIIIIIIIMMMIIIMIMIIMMIMMIIIIIMIMIIIIIMI 

SBJCT: 1972 GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 2031 

65 

QUERY: 2000 GYDETTGVLKMVNLQSGGFS CT IRYRKIGPLVDKQ I YRFSEEGMVNARFDYTYHDNS FRI 2059 

IIIIIMilllilillllllllllllMllllllilllllMMIillllllMIIII 

SBJCT: 2032 GYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRFSEEGMINARFDYTYHDNSFRI 2091 

70 QUERY: 2060 ASIKPVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQI ITTAVMTLSKHFDTHGRI 2119 

85 15966-697 



IMMIIMIllMlllMlillllMI 



SBJCT : 


2092 


AS I KP VI S ETPLP VDL YR YDE I SGKVEHFGKFGVI YYD INQ 1 1 TTAVMTLSKHFDTHGR I 


2151 


QUERY: 


2120 


KEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 
i 1 1 1 ] 1 M 1 II M 11 M M 1 M 1 1 M M i 1 il i i I 1! 1 ! 1 1 1 1 1 ! 1 i 1 1 1 t 1 I 1 t 1 ! 1 1 i 


2179 


SBJCT: 


2152 


j I M M M 1 i 1 ! 1 i t 1 i i I 1 1 i M 1 1 1 M 1 M II 11 11 M 1 1 II 11 I M II 1 f 1 1 I 1! if 
KEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 


2211 


QUERY: 


2180 


TWRYSYDXXXXXXXXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 

Hlllll j i 1 it 1 1 M t i 1 1 1 1 M ! 1 i 1 1 1 II ! 1 1 1 M M 1 1 II 1 1 1 


2239 


SBJCT: 


2212 


| [ | I I M I 1 M M ! i M M 1 M i 1 I II 1 1 1 1 M 1 i M 1 1 M 1 M i M 
TWRYSYDIjNGNLHLLNPGNSARLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 


2271 


QUERY: 


2240 


NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 
1 M M 1 1 1 1 1 1 II 1 1 1 M M i ! I t 1 ! 1 1 M 1 1 1 f 1 1 1 1 1 f 1 1 1 i 1 1 + 1 1 i M 1 i 1 1 I 1 i 


2299 


SBJCT: 


2272 


I I M 1 1 1 1 1 M 1 II IE 1 1 1 1 1 II i 1 1 M M 1 1 M II M 1 ! 1 1 1 1 M * 1 1 M M M 1 11 1 
NSKGLLTRAYNKASGWSVQYRYDGVSRRASYKTNLGHHLQYFYSDLHHPTRITHVYNHSN 


2331 


QUERY: 


2300 


SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYD 
1 1 M 1 1 1 1 1 ! 1 I ! M | M M 1 1 i 1 1 1 1 1 M 1 1 i 1 I II t 1 I 1 1 1 1 1 1 1 I 1 1 1 1 !! 1 1 1 1 1 1 


2359 


SBJCT: 


2332 


1 || II 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 M 1 1 I M ! II 1 M M i 1 M M M 1 1 1 1 11 U M 1 1 IN 
SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYD 


2391 


QUERY : 


2360 


SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVIAGRWTSPDYTMWKNVGKEPAPFNLYMF 
1 i i 11 1 II 1 1 M M M M M II 1 i 1 1 II M M 1 1 1 1 I 1 1 1 1 1 1 1 l + l 1 II 1 1 1 1 1 M M 1 


2419 


SBJCT : 


2392 


I || | M | | | M 1 1 I ft 1 1 I II 1 1 i 1 1 1 I M 1 1 I 1 M 1 1 M ! 1 M 1 + 1 1 M M M 1 1 MM 
SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWRNVGKEPAPFNLYMF 


2451 


QUERY: 


2420 


KSNNPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 
UM 1 I Ul M M M M M M It 1 1 1 1 1 M 1 1 1 1 1 1 1 M II M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


2479 


SBJCT : 


2452 


I + | I | || + | M 1 i 1 1 t i i 1 M 1 1 II 1 1 1 M 1 t II 1 1 M 11 1 M 1 11 1 1 M 1 11 M M I i ! 
KNWNPLSNELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 


2511 


QUERY : 


2480 


QLITGVQQTTERHNQAFMALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGR 
I I ! I g l i l l I ! I i M I i x i I ! | 1 1 Li 1 1 1 i 1 1 1 IE 1 1 If M I 1 I M 1 II 1 1 M 1 M II 


2539 


SBJCT : 


2512 


1 1 1 1 1 II II M 1 M 1 1 l + l 1 1 1 1 ! M I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 11 1 1 M M 

QL ITGVQQTTERHNQAFLALEGQVIS KKLHAG IREKAGHWFATTTP 1 1 GKGIMFAIKEGR 


2571 


QUERY : 


2540 


VTTGVS S IAS EDSRKVAS VLNNAY YLDKMHYS I EGKDTHYF VK I GS ADGDLVTLGTT I GR 
i M 1 II 1 M 1 1 I 1 1 1 1 1 1 1 M M 1 i 1 1 1 1 I M 1 1 It M 1 M M 1 1 + 1 1 M M 1 I 1 i 1 t 1 1 


2599 


SBJCT : 


2572 


Illlllllllltlllllllliii IllllIIIIiI ilifilllltlfll 

VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGAADGDLVTLGTTIGR 


2631 


QUERY : 


2600 


KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 

IMMIIIIMMIIIIMlMIIMMIMIIlMIIMMMMMIIMllllMli 


2659 


SBJCT : 


2632 


KVLESGWVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 


2691 


QUERY: 


2660 


QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 

M II II 1 1 1 II 1 1 1 1 1 1 1 1 II I M 1 1 1 1 M 1 II M 1 M M 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 
I M 1 1 M M 1 1 M M I 1 1 1 II M f M 11 1 1 M M t 1 M M I i 1 U 1 il u il i 1 f f i f t i 

QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 


2719 


SBJCT : 


2692 


2751 


QUERY: 


2720 


SNIQFLRQNEMGKR 2733 




SBJCT : 


2752 


IIIIMMIIIIil 

SNIQFLRQNEMGKR 2 765 
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* = FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 

50 The amino acid sequences of the FCTR3bcde and f proteins were also found to have 

2536 of 2774 amino acid residues (91%) identical to, and 2558 of 2774 residues (91%) positive 
with, the 2764 amino acid residue protein Odd Oz/ten-m homolog 2 (Drosophila) (GenBank 
Acc:NP_035986.2) (SEQ ID NO:65), shown in Table 3U. 

Table 3U. BLASTP of FCTR3bcde and f against Odd Oz/ten-m homolog 2 (SEQ ID 
55 NO:65) 

> GI|7657415|REF|NP 035986.21 ODD OZ/TEN-M HOMOLOG 2 (DROSOPHILA) ; ODD OZ/TEN-M HOMOLOG 
3 

(DROSOPHILA) [MUS MUSCULUS] 
Gl|4760778[DBJ|BAA77397.l| (AB025411) TEN-M2 [MUS MUSCULUS] 
60 LENGTH = 2764 

SCORE = 4996 BITS (12961), EXPECT = 0.0 
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10 



30 



IDENTITIES = 2536/2774 (91%), POSITIVES = 2558/2774 (91%), GAPS = 51/2774 (1%) 
QUERY* 1 MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNR 60 

MtlMIIMlMMIMlMIIIIMIMIIIIIMMMMlMMIIIIIIIMIM 

SBJCT: 1 MDVKDRRHRSLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNR 60 
QUERY* 61 VTDLIHRESDEFPRQGTNFTLAELGICEPSPHRSGYCSDIVIGILHQGYSLSTGSDADSDTE 120 

MIMIIIMI MMMIIlllMMIIIMIMMIMMMIIIlMllMillli 

SBJCT: 61 VTDLVHRESDEFSRQGTNFTLAELGICEPSPHRSGYCSDMGILHQGYSLSTGSDADSDTE 120 
QUERY: 121 GGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTXXXXXXXXXXXXGRXXXXXXXXXXX 180 

1 1 1 1 1 1 1 1 1 1 f I M 11 i 1 1 1 1 1 1 1 M 1 M I II t f I !l 

SBJCT: 121 GGMS PEHAIRLWGRGI KSRRSSGLS SRENS ALTLTDSDNENKSDDDNGRP I PPTSSS SLL 180 



15 QUERY - 181 XXXXXXXXHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACXXXXXXXXXXXX 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i M 1 1 1 ! 1 1 1 1 

SBJCT: 181 PSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPP 240 

QUERY: 241 NHHSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQIHAPAPAPNDLATTPESVQ 3 00 

20 {MM I I I I II I I I I I I I I I I I t I I 

SBJCT: 241 NHHSQSTLRPPLPPPHNHTLSHHHSSANSLKRNSLTNRRSQIHAPAPAPNDLATTPESVQ 3 00 

QUERY: 301 LQDSWLNSNVPLETRHFLFKXXXXXXXXXXXXXXXYPLTSGTVYTPPPRLLPRNTFSRK 360 

IIIIIIIMIMIMIIIMI MIIIIIIIMIMIIMIIIIM 

25 SBJCT: 301 LQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 360 
QUERY: 361 AFKLKKPSKYCSWKCXXXXXXXXXXXXXXXXXYFI 395 

IMMIIIIIIIMI Ml 

SBJCT: 361 AFKLKKPSKYCSWKCAALSAIAAALLLAILLAYFIAMHLLGLNWQLQPADGHTFNNGVRT 42 0 



QUERY * 3 96 VPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 43 9 

MIMMMMIIIMIIIiMIIIIIIIMMIMIIIIIMI 

SBJCT: 421 GLPGNDDVATVPSGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLK 480 



35 QUERY * 440 FNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVF 499 

IMMIIIIMIIIIMMlllMllllMIIIIIMIIIIMIillMMIIIMIIII 

SBJCT: 4 81 FNISLGKDALFGVYIRRGLPPSHAQYDFMERLDGKEKWS WES PRERRS IQTLVQNEAVF 54 0 
QUERY: 500 VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGVCHCFPGFLGA 559 

40 IIMIIMIIIIIIIIIMiMIIIIMIMMMMIIIIlMMMhllMMMII 

SBJCT: 541 VQYLDVGLWHLAFYNDGKDKEMVSFNTWLDSVQDCPRNCHGNGECVSGLCHCFPGFLGA 600 
QUERY: 560 DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCSA 619 

I { 1 1 1 1 1 1 M ! I ! I f 1 1 1 i 1 1 1 f i 1 1 1 M i 1 1 1 E i I i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 E 1 1 1 1 ^ 1 

45 SBJCT: 601 DCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGHGSCIDGNCVCAA 660 
QUERY: 620 GYKGEHCEEVDCLDPTCSSHGVCWGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDT 679 

IIIIMIIIIIIIIMMIIMIMIIMMIIIMIIIIMIMMIIIIMIMIIk 

SBJCT: 661 GYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDS 720 

50 

QUERY: 680 GLCSCDPNWMGPDCSVEVCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTC 739 

IIIIIIMIIMllll IIIIIIIIIIIMMIIIIIIIMIIIMIMIMIMIMII 

SBJCT: 721 GLCSCDPNWMGPDCSV-VCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTC 779 

********* 

55 QUERY: 740 KDGKCECREGWNGEHCTIGRQTAGTETDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 799 

MIMIIIIIIIMIIM IIIIMIIIIIMIIIIIIMMIIMIIIMI 

SBJCT: 780 KDGKCECREGWNGEHCTI DGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830 

QUERY: 800 NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVK 859 

60 | I I I I I I I I I I I I I I I II II I I I I I I I I I II I I I I II I I I I I I 1 I I I I I I I I I ! I I I I I I 

SBJCT: 831 NVAMETSCADNKDNEGDGLVDCLDPDCCLQSACQNSLLCRGSRDPLDIIQQGQTDWPAVK 890 

QUERY: 860 SFYDRIKLLAGKDSTHIIPGENPFNSSLVSLIRGQWTTDGTPLVGVNVSFVKYPKYGYT 919 

M i I f 1 1 1 1 1 ! 1 1 1 1 1 1 1 f I ^ M 1 1 1 1 1 1 i M 1 1 M f I IIIIMIIIIMIIIMIM1 

65 SBJCT: 891 S FYDRI KLLAGKDS THI I PGDNPFNS SLVSL I RGQ WTMDGTPLVGVNVS FVKYPKYGYT 950 
QUERY * 920 I TRQDGTFDL IANGGASLTLHFERAPFMSQERTVWLPWNS FYAMDTLVMKTEENS I PS CD 979 

llllimillllll++IIIIIIIIIIIM!l!ll!llllliim!l!!milllMI 

SBJCT : 951 I TRQDGTFDL IANGGSALTLHFERAPFMSQERTVWLPWNS FYAMDTLVMKTEENS I PS CD 1010 



70 
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QUERY * 980 LSGFVRPDP 1 1 1 S S PLSTFFSAAPGQNP I VPETQVLHEE I ELPGSNVKLRYLSSRTAGYK 1039 

IMMMMSIMMIIMMM 1 1 II II 1 1 II 1 1 II I II l + M 1 1 1 1 1 1 1 II M II 

SBJCT: 1011 LSGFVRPDPIIISSPLSTFFSASPASNPIVPETQVLHEEIELPGTNVKLRYLSSRTAGYK 1070 
5 QUERY * 1040 SLLKITMTQSTVPLNL IRVHLMVAVEGHLFQKS FQASPNLASTF I WDKTDAYGQRVYGLS 1099 

MIMMIMMIMMMIIIMIIIMIIIIIIIIIMI II 1 1 II 1 1 1 1 1 1 1 1 M M 

SBJCT: 1071 SLLKITMTQSOTPLNLIRVHLMVAVEGHLFQKS FQAS PNLAYTF I WDKTDAYGQRVYGLS 1130 
QUERY * 110 0 DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGILHKGTGE 1159 

10 ' IIIMIIIIIIMMIIIIIIIIIIMIIIIMIIIIIIMMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SBJCT: 1131 DAWSVGFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHTLNVKSGILHKGTGE 1190 
QUERY- 1160 NQFLTQQPAI ITS IMGNGRRRS ISCPSCNGLAEGNKLLAPVALAVGIDGSLYVGDFNYIR 1219 

MiMMIMMMMIIIMIIIIMIMMIIIIMIMIIIIIIMIMMIIMI 

15 SBJCT: 1191 NQFLTQQPAI ITS IMGNGRRRS ISCPSCNGLAEGNKLLAPVALAVGIDGSLFVGDFNYIR 1250 
QUERY - 1220 RIFPSRNVTSILELRNKEFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTK 1279 

MIMMIIIIIIIMIIIIMM IMIIIMII-HIMIMIIIIIIMMIIII I 

SBOCT: 1251 RIFPSRNVTSILELRNKEFKHSNSPGHKYYLAVDPVTGSLYVSDTNSRRIYRVKSLSGAK 1310 



20 



40 



60 



QUERY* 1280 DLAGNSEWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKV 1339 

MIIIIIIIIIIIIIIIIIIMIIIIIIII + IIIIIIIIMIIIIIIMIIIIIIMIII 

SBJCT: 1311 DLAGNSEWAGTGEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKV 1370 



25 QUERY- 1340 DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1399 

IIIIIMIimillMIIIMMIIIIMIIIiMimillMIIIIIIIMIillll 

SBJCT: 1371 DQNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDLAVNPMDNSLYVLENNVILRI 1430 

QUERY: 1400 TENHQVSIIAGRPMHCQVPGIDYSLSKXXXXXXXXXXXXXXXXXTGVLYITETDEKKINR 1459 

30 | | | | | | | | | | | | | | | | | | | | M II I I I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 

SBJCT: 1431 TENHQVSIIAGRPMHCQVPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKINR 1490 

QUERY: 1460 LRQVTTNGE I CLLAGAASXXXXXXXXXXXXYSGDDAYATDAI LNS PSSLAVAPDGTI YI A 1519 

1 1 ! | ! I i I ' ; I I : I I IIIIIMIIIIMIMIIIMIIMIIIII 

35 SBJCT: 1491 LRQVTTNGEICLLAGAASDCDCKNDVNCICYSGDDAYATDAILNSPSSLAVAPDGTIYIA 1550 
QUERY* 1520 DLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1579 

I E 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 f ! 1 ! 1 1 i 1 1 1 1 1 1 1 E 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 

SBJCT: 1551 DLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVTGEYLYNFTYS 1610 



QUERY: 1580 TDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWSTQNLELGLM 163 9 

IIIIIIIIMIMIIIIMMIIIMIIIIIIIMIIIIMIIMM IIIIIIMIII 

SBJCT: 1611 ADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLM 1670 



45 QUERY • 164 0 TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNR 16 99 

MlllllllllillllllllMIMIMIIIIIIIIIIMMIlllllllllMIIMII 

SBJCT: 1671 TYDGNTGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNR 1730 
QUERY* 1700 DDDVTVITNLSSVEASYTWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTIT 1759 

50 III III I III MM III I III III I III MM II II III III II HIM I II I III I II 

SBJCT: 1731 DDDVTVITNLS SVEAS YTVVQDQVRNS YQLCNNGTLRVMYANGMAVSFHSEPHVLAGT IT 1790 

QUERY: 1760 PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1819 

I II I I I I I I I I II I I I I I I I I M II I I I I M I I II II I I II II II M I I II I I I M M II 
55 SBJCT: 1791 PTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDYDRNIRTEKI 1850 

QUERY: 1820 YDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGR 1879 

M 1 1 II M II 1 1 II II II M II II I M 1 1 1 II II 1 1 M 1 1 1 1 II II M 1 1 1 1 1 1 II M 1 1 

SBJCT: 1851 YDDHRKFTLRI I YDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTD IDKQGR 1910 



QUERY: 1880 IVSRMFADGKVWSYSYLDKSIVT^LLLQSQRQYIFEYDSSDRLLAVTMPSVARHSMSTHTSI 193 9 

MMIIIIMMIIIIIIIIIMMIMMIIIIIMMII 1 1 1 M I ! i 1 1 1 II II I M 

SBJCT: 1911 I VSRMFADGKVWS YS YLDKSMVLLLQSQRQYI FEYDSSDRLHAVTMPSVARHSMSTHTS I 1970 



65 QUERY: 1940 GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 1999 

I II II I II I II II 11 I II I II I II II II I I II I I I I II II II I II I I I I II II I II I II I 
SBJCT: 1971 GYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTF 2030 

QUERY: 2000 GYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRI 2059 

70 MMMMMMMMMIMIIMiMMIilMMMIMMIMMMIIMIM 
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SBJCT: 2031 GYDETTGVLKMVNLQSGGFS CT I RYRKVGPLVDKQ I YRFS EEGM INARFD YTYHDNSFR I 2090 
QUERY: 2060 ASIKPVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRI 2119 

MIIIIMMIMIIIIMIMllMlllllllilllllillllMMIIMMIIIIM 

5 SBJCT: 2091 AS I KPVI SETPLPVDLYRYDEISGKVEHFGKFGVI YYDINQ 1 1 TTAVMTLSKHFDTHGR I 2150 
QUERY * 212 0 KEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 2179 

MMMIIIMillllMIIIIMMiMIIIIIMllllMiMMIMMMMlIM 

SBJCT * 2151 KEVQYEMFRSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRP 2210 

10 

QUERY- 2180 TWRYSYDXXXXXXXXXXXXSVRLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 2239 

MIIMI I MIIIIMIMMIMIIMMMIIIIIMIMIIMI 

SBJCT: 2211 TWRYSYDLNGNLHLLNPGNSARLMPLRYDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEY 2270 
15 QUERY: 224 0 NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 2299 

IIMIIIIIilMMlMMIlllMMIMIIIIMIIMMMllllMIIMMMI 

SBJCT: 2271 NSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHLQYFYSDLHNPTRITHVYNHSN 2330 
QUERY- 2300 SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYD 2359 

20 1 i 1 1 1 [ 1 1 1 1 1 1 ! M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 f 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

SBJCT: 2331 SEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVYSINGLMI KQLQYTAYGE IYYD 2390 
QUERY * 2360 SNPDFQIWIGFHGGLYDPLTKLVHFTQRDYDVIAGRWTSPDYTMWKNVGKEPAPFNLYMF 2419 

IIIIMMMMllllilllMIMIMMIillllMillllllHMMiMMMM 

25 SBJCT: 2391 SNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWRNVGKEPAPFNLYMF 2450 
QUERY* 2420 KSNNPLSSELDLKNYVTDVKSWLVMFGFQLSNI I PGFPRAKMYFVPPPYELSESQASENG 2479 

k!MII + IIIIIMIIilllilllillllliililliiiiIIIMMIItMMIIIII 

SBJCT* 2451 KNNNPLSNELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENG 2510 

30 

QUERY: 2480 QLITGVQQTTERHNQAFMALEGQVITKKLHASIREKAGHWFATTTPIIGKGIMFAIKEGR 253 9 

IMlMIIMIIIIIIhllllllllllllMIIMMIIMlllllMMIIIMIMI 

SBJCT: 2511 QLITGVQQTTERHNQAFLALEGQVITKKLHAS IREKAGHWFATTTPI IGKGIMFAIKEGR 2570 
35 QUERY • 2540 VTTGVSSIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGR 2599 

MUMMMIMIIiliiilMlllliMlliliiiillllllMIMIIIIMIMI 

SBJCT: 2571 VTTGVSS IASEDSRKVASVLNNAYYLDKMHYS IEGKDTHYFVKIGAADGDLVTLGTTIGR 2630 
QUERY * 2600 KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAR 2659 

40 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1! 1 1 1 1 ! 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 M 1 1 1 M I! ! M 

SBJCT: 2631 KVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSIRYGLTPDTLDEEKARVLDQAG 2690 
QUERY: 2660 QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 2719 

MilMMIIIlMIMIIIMIMMMIililMIIIIIIIIIIIIIIINNNNi 

45 SBJCT: 2691 QRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSS 2750 



QUERY: 2720 SNIQFLRQNEMGKR 2733 

IIIIIIIIMII1I 

SBJCT : 2751 SNIQFLRQNEMGKR 2764 

50 

* = FCTR3F DOES NOT CONTAIN THESE AMINO ACIDS 



FCTR3 is related to rat neurestin, a gene implicated in neuronal development (Otaki JM ? 
Firestein S Dev Biol 1999 Aug 1;212(1):165-81) Neurestin shows homology to human gamma- 

55 heregulin, a Drosophila receptor-type pair-rule gene product, Odd Oz (Odz) / Ten(m), and 

Ten(a). Neurestin has putative roles in synapse formation and brain morphogenesis. A mouse 
neurestin homolog, DOC4, has independently been isolated from the NIH-3T3 fibroblasts. 
DOC4 is also known as tenascin M (TNM), a Drosophila pair-rule gene homolog containing 
extracellular EGF-like repeats. The significant homology to these molecules and in particular, y- 

60 heregulin, have important implications regarding the potential contribution of FCTR3 to disease 
progression. Heregulin is the ligand for HER-2/ErbB2/NEU, a proto-oncogene receptor tyrosine 
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kinase implicated in breast and prostate cancer progression that was originally identified in rat 
neuro/glioblastoma cell lines. Extopic expression of HER-2/ErbB2/NEU in MDA-MB-435 
breast adenocarcinoma cells confers chemoresistance to Taxol-induced apoptosis relative to 
vector transfected control cells (Yu et al. Overexpression of ErbB2 blocks Taxol-induced 
5 apoptosis by up-regulation of p21Cipl, which inhibits p34Cdc2 kinase. Molec. Cell 2: 581-591, 
1998). 

FCTR3 related tenascins and cancer biology 

As mentioned, FCTR3 also has significant homology to DOC4, (AKA tenascin M), a 
Drosophila pair-rule gene homolog containing extracellular EGF-like repeats. The tenascins are 
1 0 a growing family of extracellular matrix proteins that play prominent roles in tissue interactions 
critical to embryogenesis. Overexpression of tenascins has been described in multiple human 
solid malignancies. 

The role of the tenascin family of related proteins is to regulate epithelial-stromal 
interactions, participate in fibronectin-dependent cell attachment and interaction. Indeed, 

1 5 tenascin-C (TN) is overexpressed in the stroma of malignant ovarian tumours particularly at the 
interface between epithelia and stroma leading to suggestions that it may be involved in the 
process of invasion (Wilson et al (1996) Br J Cancer 74: 999-1004). Tenascin-C is considered a 
therapeutic target for certain malignant brain tumors (Gladson CL : J Neuropathol Exp Neurol 
1999 Oct;58(10):1029-40). Stromal or moderate to strong periductal Tenascin-C expression in 

20 DCIS (ductal carcinoma in situ) correlates with tumor cell invasion. (Jahkola et al. Eur J Cancer 
1998 Oct;34(l l):1687-92. Tenascin-C expression at the invasion border of early breast cancer is 
a useful predictor of local and distant recurrence. Jahkola T, et al. Br J Cancer. 1998 
Dec;78(l 1):1507-13). Tenascin (TN) is an extracellular matrix protein found in areas of cell 
migration during development and expressed at high levels in migratory glioma cells. 

25 Treasurywala S, Berens ME Glia 1998 Oct;24(2):236-43 Migration arrest in glioma cells is 

dependent on the alphaV integrin subunit. Phillips GR, Krushel LA, Crossin KL J Cell Sci 1998 
Apr;lll (Pt 8): 1095 -104 Domains of tenascin involved in glioma migration. Finally, tenascin 
expression in hormone-dependent tissues of breast and endometrium indicate that Tenascin 
expression reflects malignant progression and is down-regulated by antiprogestins during 

30 terminal differentiation of rat mammary tumors (Vollmer et al. Cancer Res 1 992 Sep 
l;52(17):4642-8) 

Potential role of FCTR3 in oncologic disease progression: 
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Based on the bioactivity described in the medical literature for related molecules, FCTR3 
may play a role in one or more aspects of tumor cell biology that alter the interactions of tumor 
epithelial cells with stromal components. In consideration, FCTR3 may play a role in the 
following malignant properties: 

Autocrine/paracrine stimulation of tumor cell proliferation 
Autocrine/paracrine stimulation of tumor cell survival and tumor cell resistance to 
cytotoxic therapy 

Local tissue remodeling, paranechmal and basement membrane invasion and motility of 
tumor cells thereby contributing to metastasis. 

Tumor-mediated immunosuppression of T-cell mediated immune effector cells and 
pathways resulting in tumor escape from immune surveilance. 

Therapeutic intervention targeting FCTR3 in oncologic and central nervous system 

indications: 

Predicted disease indications from expression profiling in 41 normal human tissues and 
55 human cancer cell lines (see Example 2) include a subset of human gliomas, astrocytomas, 
mixed glioma/astrocytomas, renal cells carcinoma, breast adenocarcinoma, ovarian cancer, 
melanomas. Targeting of FCTR3 by human or humanized monoclonal antibodies designed to 
disrupt predicted interactions of FCTR3 with its cognate ligand may result in significant anti- 
tumor/anti-metastatic activity and the amelioration of associated symptomatology. 
Identification of small molecules that specifically/selectively interfere with downstream 
signaling components engaged by FCTR3/ligandinteractions would also be expected to result in 
significant anti-tumor/anti-metastatic activity and the amelioration of associated 
symptomatology. Likewise, modified antisense ribonucleotides or antisense gene expression 
constructs (plasmids, adenovirus, adeno-associated viruses, "naked" DNA approaches) designed 
to diminish the expression of FCTR3 transcripts/messenger RNA (mRNA) would be anticipated 
based on predicted properties of FCTR3 to have anti-tumor impact. 

Based on the relatedness to neurestin and heregulins, as well as its high level expression 
in brain tissue, FCTR3 may also be used for remyelination in order to promote 
regeneration/repair/remyleination of injured central nervous system cells resulting from 
ischemia, brain trauma and various neurodegenerative diseases.. This postulate is based on 
reports indicating that neuregulin, glial growth factor 2, diminishes autoimmune demyelination 
and enhances remyelination in a chronic relapsing model for multiple sclerosis (Cannella et al. . 
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Proc. Nat. Acad. ScL 95: 10100-10105, 1998). The expression of the related molecule neurestin 
can be induced in external tufted cells during regeneration of olfactory sensory neurons. 



FCTR4 

FCTR4 is a plasma membrane protein related to NF-Kappa-B P65delta3 protein. The 
5 clone is expressed in fetal liver tissues. 

The novel FCTR4 nucleic acid of 609 nucleotides (also referred to as 29692275.0.1) is 
shown in Table 4A. An ORF begins with an ATG initiation codon at nucleotides 99-101 and 
ends with a TAA codon at nucleotides 522-524. A putative untranslated region upstream from 
the initiation codon and downstream from the termination codon is underlined in Table 4A, and 
10 the start and stop codons are in bold letters. 

Table 4A. FCTR4 Nucleotide Sequence (SEQ ID NO: 14) 

CTGACATACTATATTAGTTGTTTGTTCACTGTCTCCACTCC^GCTAGAATATAAGTTCCATAGGGCAGAGTTTTTGTTCA 
CTGCTATATTTTATAAGC ATGAATGAATGCATC^ 

ACTGCCCTCCTTTCATTTTATTCTCACCTCTACCAATACTAAATCACCTAGTTATGTAAATACGATATGCACTTTCATGG 

15 CCCCTTGCTTTGTCATATGCTGTTCCCTT^ 

AGAAAGCTTCCTCTGTCCACCCCCACCCTCCCACCCCCATATAGAGTAAGTCAGTCTTTCCTTTGTGCTACATTTGTACC 
TGTATCTACAGTGGCTCTAATCAAACTGCACTGTGTCTCTCACTTCCTAGATTGTGAACTCTTTGAGGCTGAAGACTACT 
TATTraT PTPTTTA C PT f! P A ATGC CT AGGACAGGACC TT CAT A AAGCAACTACT C TATAAATGTTGAAACATATG CATGA 
CTATTCTGTAACAGGAATGAAAATATGGCATTTCAAGAAGTCACTACTC 

20 ~ 

The FCTR4 protein encoded by SEQ ID NO: 14 has 141 amino acid residues and is 
presented using the one-letter code in Table 4B. The Psort profile for FCTR4 predicts that this 
sequence has no N-terminal signal peptide and is likely to be localized at the plasma membrane 
with a certainty of 0.6000. The most likely cleavage site for a peptide is between amino acids 39 
25 and 40, i.e., at the dash in the amino acid sequence ACT-CCA, based on the SignalP result. The 
predicted molecular weight of this protein is 16051.5 Daltons. 

Table 4B. Encoded FCTR4 protein sequence (SEQ ID NO: 15). 

MNECiyiNEWTDNPQAKDLHDLPLPSFHFILTSTNTKSPSYVNTICTFMAPCFVICCSLCLEYKLSKYHPHFKIFSRKLPLSTPTLPP 
PYRVSQSFLCATFVPVSTVALIKLHCVSHFLDCELFEAEDYLFISLPPMPRTGPS 

30 

The predicted amino acid sequence was searched in the publicly available GenBank 
database FCTR4 protein showed 30 % identities (22 over 72 amino acids) and 43% homologies 
(3 lover 72 amino acids) with hypothetical 10 kD protein of Trypanosoma cruzi (86 aa; 
ACC:Q99233) shown in Table 4C. The best homologies with a human protein were 54 % 
35 identities (1 14 over 343 amino acids) with NF-Kappa-B P65delta3 protein (71 aa fragment; 
ACC:Q13313) (SEQ IDNO:77). 

Table 4C BLASTP of FCTR4 against protein sequences 

BLAST X search results are shown below: 

ptnr:SPTREMBL-ACC:Q99233 HYPOTHETICAL 10 KD PROTEIN +3, 68, 0.60, 1, (SEQ ID 
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NO:73) 

ptnr:SPTREMBL-ACC:Q16896 GABA RECEPTOR SUBUNIT - AEDES +3, 66, 0.81, 4 (SEQ 
IDNO:74) 

ptnr:SPTREMBL-ACC:076473 GABA RECEPTOR SUBUNIT - LEPTI... +3, 66, 0.99, 2 (SEQ 
5 IDNO:75) 

ptnr:TREMBLNEW-ACC: AAD283 1 7 F13J1L13 PROTEIN - Arabid... +3, 62, 0.99, 1 (SEQ ID 
NO:76) 

Based upon homology, FCTR4 proteins and each homologous protein or peptide may 
10 share at least some activity. 



FCTR5 

FCTR5 is a protein bearing sequence homology to human complement C1R component 
precursor. The clone is expressed in breast, heart, lung, fetal lung, salivary gland, adrenal gland, 
1 5 spleen, kidney, and fetal kidney. 

The novel FCTR5 nucleic acid of 1667 nucleotides (also referred to as 32125243.0.21) is 
shown in Table 5A. An ORF begins with an ATG initiation codon at nucleotides 34-36 and ends 
with a TGA codon at nucleotides 1495-1497. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 5 A, and the 
20 start and stop codons are in bold letters. 

Table 5 A. FCTRSa Nucleotide Sequence (SEQ ID NO: 16) 

GTTCTCTCGCAGGTCCCAGATGTCCAGTTCCAGA TGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCCCTCA 
CTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGGCTTGCCCAACCCGGGGCTCCGTCC 

TCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCCCCGG 

25 GACATCAAGGCTCCAGAGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTGCAGG 
GGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGCAAGGCTCCCCTCTGGGCAGGCCCC 
CTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTTTGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACT 
GCCCACCTCCACAAGGGCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCAGCCCATCAGCGAGGCCAGCAG 
GGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAGGTCCAGAACCACTGCCAGGAGCCCTATTATCAGGCCG 

30 CGGCAGCAGGGGCACTCACCTGTGCAACCCCAGGGACCTGGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAGTGTATG 
CCTGTCTGCGGACGGCCAGTCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTCTTCCAGAGCCAAGCTGGGCAACTT 
CCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGGACAGATGGATCCTCACTGCTGCCC 
ACACCATCTACCCCAAGGACAGTGTTTCTCTCAGGAAGAACCAGAGTGTC 

GAGATGCTGAAACTGGGGAACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATAACTT 
35 TAGCGGGGACATCGCCCTCCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACGTCCTCCCGGTCTGTCTGCCCGATA 
ATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACGTCAGTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTG 
AAGTACT CGAGG CTG C CTGTAG CT C C CAGGGAGG C CTGCAACGC CTGG CT C CAAAAGAGACAGAGAC C CGAGGTGTTT T C 
TGACAATATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGGACAGTGGCAGCCTCTATGTGGTAT 
GGGAGAATCATGCGCATCACTGGGTGGCCACGGGCATTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTAC 

40 arraarinTttrTranrTaTft^ 

GACCAGCACAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACTGGGGTAGGGGGTGGGGGTTTCTCT 
TGCAGTGGCTTGGTGCAACAGTGATGTGAATAGGATTTCCCTTTTTTTTTTTTTTTTTAAAAAAAAA 
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The FCTR5 protein encoded by SEQ ID NO:16 has 487 amino acid residues, and is 
presented using the one-letter code in Table 5B. FCTR5 was searched against other databases 
using SignalPep and PSort search protocols. The FCTR5 protein is most likely microbody 
(peroxisome) (Certainty=O.6406) and seems to have no N-terminal signal sequence. The 
5 predicted molecular weight of FCTR5 protein is 5351 1 .9 daltons. 



Table 5B. Encoded FCTRSa protein sequence (SEQ ID NO:17). 

MPGPRWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVFQDF 
DLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPIS 
10 EASRGSEAINAPGDNPAKVQNHCQEPYYQAAAAGALTC^TPGTWKDRQDGEEVLQCMPVCGRPVTPIAQNQTTLGSSRAKLGNFPW 
QAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVOTFLGHTAIDEMLKLGNHPVHRVVVH 

LQHSIPLGPNVLPVCLPDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMFCVGDETQRHS 
VCQGDSGSLYVWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIKGVMNGKN 

15 

An alternative embodiment, FCTRSb, is a 1691 base sequence shown in Table 5C. 
Table 5C. FCTRSb Nucleotide Sequence (SEQ ID NO:18) 

TTTTTTTTTAAAAAAAAAAAAAAAAAGGGAAATCCTATTC^^ 

ACCCCAGTGTTCAGTCCTCACTCCAGGCCCTCTGTTGCCTGGGGCCTCCACTGTGCTGGTCAGTCCCTGTTCAAGCCCCCAGGGTC 

20 AATTCTTGCCATTCATCACTCCCTTGATCCAGTC 

CCCCAGGACACAATGCCCGTGGCCACCCAGTGATGGGCATGATTC 

ACTGTGCCTTTGCGTCTCATCCCCAACACAGAACATATTGTCAGAAAACACCTCGGGTCTCTGTCTCTTTTGGAGCCAGGCGTTGC 
AGGCCTCCCTGGGAGCTACAGGCAGCCTCGAGTACTTCAGCTCAGT^^ 

CCCAACAAGCCGCTGCGGTAGAGGGTCTCATTATCGGGCAGACAGACCGGGAGGACGTTGGGGCCCAGGGGGATGCTGTGCTGCAG 
25 CTCCAGGAGGGCGATGTCCCCGCTAAAGTTATGGGACTCATTCTGACGGTAGTCGGGGTGCACAACGACACGGTGGACAGGGTGGT 
TCCCC^GTTTCAGCATCTCATCTATGGCTGTGTC 

GGGTAGATGGTGTGGGCAGCAGTGAGGATCCATCTGTCCCCCAGCAGGGCCCCGCCCCCACGGCCGTGGATACTGGTGAAGGCTTG 
CCAGGGGAAGTTGCCCAGCTTGGCTCTGGAAGAACCGAGGGTCGTCTGATTCTGGGCAATGGGGGTGACTGGCCGTCCGCAGACAG 
GCATACACTGAAGAACCTCCTCCCCATCCTGTCTGTCTTTCCAGGTCCCTGGGGTTGCACAGGTGAGTGCCCCTGCTGCCGCGGCC 

30 TGATAATAGGGCTCCTGGCAGTGGTTCTGGACCTTGGCAGGGTTGTCTCCAGGTGCGTTGATGGCCTCAGAGCCCCTGCTGGCCTC 
GCTGATGGGCTGACTATAGTTCACAGCCACGGTTTGGTAGAGGGCCAGGAAGCCCTTGTGGAGGTGGGCAGTCTTGTTCTCCGAGG 
AAGGCTGTGTGCGGAAGGTCAGCCGCAAACTCCTCCCTGAGGATACAAACTCCCTCTGAC(^GGGGGCCTGCCC^GAGGGGAGCCT 
TGCTGACCACAGAACTGGCTTGGATCCGAACCGACGAATGAGATTGTGACAGAGTCCCCTGCACAGTCCTGGGACGGCTCCAGGTC 
GAAGTCCTGGAAGACGAGCCTCACAGCAAAGCCCTCTGGAGCCTTGATGTCCGTGCTGCTCTCTTGGCCTTTGCCATACGGCTCTG 

35 GGTACCCGGGGGATGTCAGCTGCTGGGGTAGCTGTTGGGCCAAGAGGACGGAGCCCCGGGTTGGGCAAGCCTGGAGGACTCCCCAG 
AGAAGC^GCC^CCACATTGCGCCTGGACAGCCTTTGGAGTGAGGGCTTCTCCAGAGATATTTCCCCCACACTCTGGGTCCAGGCAT 
CTGGAACTGGACATCTGGGACCTGCGAGAGAACTGGCCCAGGATAGGGAACAAAAGG 



The FCTRSb protein encoded by SEQ ID NO: 18 has 487 amino acid residues, and is 
40 presented using the one-letter code in Table 5D. FCTR5 was searched against other databases 
using SignalPep and PSort search protocols. The FCTRSb protein is most likely microbody 
(peroxisome) (Certainty=0.6406) and seems to have no N-terminal signal sequence. The 
predicted molecular weight of FCTR5 protein is 5351 1.9 daltons. 

Table 5D. Encoded FCTRSb protein sequence (SEQ ID NO: 19). 

45 MPGPRWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQQLPQQLTSPGYPEPYGKGQESSTDIKAPEGFATOLVFQDF 
DLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPIS 
EASRGSEAINAPGDNPAKVQNHCQE P YYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTP IAQNQTTLGSS RAKLGNFPW 
QAFTS IHGRGGGALLGDRW I LTAAHT I YPKDS VS LRKNQS VNVFLGHTAI DEMLKLGNHP VHRWVHPDYRQNESHNFSGDIALLE 
LQHSIPLGPNVLPVCLPDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMFCVGDETQRHS 

50 VCQGDSGSLYVVWDNHAHHWATGIVSWGIGCGEGYDFYTKVLSYVDWIKGVMNGKN 
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The predicted amino acid sequence was searched in the publicly available GenBank 
database FCTRSa protein showed 58 % identities (177 over 302 amino acids) and 74 % 
homologies (226 over 302 amino acids) with human complement C1R component precursor 
(EC 3.4.21.41) (705 aa.; ACC:P00736). Based upon homology, FCTR5 proteins and each 
5 homologous protein or peptide may share at least some activity. 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence the nucleotides 17-1594 of FCTR5a have 1575 of 1578 bases (99 %) identical to Homo 
sapiens complement Clr-like proteinase precursor (GENBANK-ID: XM_007061.1) (SEQ ID 
NO:78) (Table 5E). 

1 0 Table 5E. BLASTN of FCTRSa against Homo sapiens complement Clr-like proteinase 

precursor (SEQ ID NO:78) 

>G] |ll436767|REF|XM_007061.l| HOMO SAPIENS COMPLEMENT C1R-LIKE PROTEINASE PRECURSOR, 
(LOC51279) , 

MRNA 

15 LENGTH « 3318 

SCORE = 3104 BITS (1566), EXPECT = 0.0 
IDENTITIES a 1575/1578 (99%) 
STRAND = PLUS / PLUS 



20 



40 



QUERY: 17 CAGATGTCCAGTTCCAGATGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCC 76 

lllllMlllllMIIIIIIIIIMIMIlMMIIIIIIIIIlllilMMIIIIMM 

SBJCT: 1 CAGATGTCCAGTTCCAGATGCCTGGACCCAGAGTGTGGGGGAAATATCTCTGGAGAAGCC 60 



25 QUERY - 77 CTCACTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGG 13 6 

M M I M I II 1 1 1 1 1 1 i 1 1 1 F I i 1 1 1 ! 1 i I M M 1 1 1 1 1 1 1 M I i ! 1 1 1 1 M E 1 1 1 1 ) I } 

SBJCT: 61 CTCACTCCAAAGGCTGTCCAGGCGCAATGTGGTGGCTGCTTCTCTGGGGAGTCCTCCAGG 120 
QUERY- 13 7 CTTGCCCAACCCGGGGCTCCGTCCTCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCC 196 

30 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 ! 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 

SBJCT: 121 CTTGCCCAACCCGGGGCTCCGTCCTCTTGGCCCAAGAGCTACCCCAGCAGCTGACATCCC 18 0 
QUERY: 197 CCGGGTAC C CAGAGC CG TATGG CAAAGG C CAAGAGAGCAG CACGGACAT CAAGG CT CCAG 256 

MIMIIIIIMMMIIIIIMIIIIIIIMIMIMIMIIIIMMMMMIMII 

35 SBJCT: 181 CCGGGTACCCAGAGCCGTATGGCAAAGGCCAAGAGAGCAGCACGGACATCAAGGCTCCAG 240 
QUERY * 257 AGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTG 316 

MIIIIIIIMIillMlllllMMIIIIIIMIIMIIIMMIMMMMIIIIII 

SBJCT: 241 AGGGCTTTGCTGTGAGGCTCGTCTTCCAGGACTTCGACCTGGAGCCGTCCCAGGACTGTG 300 



QUERY: 317 CAGGGGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGC 376 

IIIIIIMlMIIMIIIIIIIIillllllllllllllMIMMMIIIIIIMIIIII 

SBJCT: 301 CAGGGGACTCTGTCACAATCTCATTCGTCGGTTCGGATCCAAGCCAGTTCTGTGGTCAGC 360 



45 QUERY: 3 77 AAGGCTCCCCTCTGGGCAGGCCCCCTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTT 43 6 

lllllll!IIIIIIIIM!IIIIIM!!iMIIIIMlllllllllMIMIIMMIM 

SBJCT: 3 61 AAGGCTCCCCTCTGGGCAGGCCCCCTGGTCAGAGGGAGTTTGTATCCTCAGGGAGGAGTT 420 
QUERY: 437 TGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACTGCCCACCTCCACAAGG 496 

50 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M I II 

SBJCT: 421 TGCGGCTGACCTTCCGCACACAGCCTTCCTCGGAGAACAAGACTGCCCACCTCCACAAGG 48 0 
QUERY: 497 GCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCAGCCCATCAGCGAGGCCA 556 

MIMIIIlllllMIIIIIIMIIMIMIIIIIIIItMIIIIIIIIIMIIIMMI 

55 SBJCT: 4 81 GCTTCCTGGCCCTCTACCAAACCGTGGCTGTGAACTATAGTCAGCCCATCAGCGAGGCCA 540 
QUERY: 557 GCAGGGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAGGTCCAGAACCACT 616 

IIIMIIIIIIMMIMIIIIIIMIIIIIIMMMMIMMMIMIIMMMII 
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SBJCT: 541 GCAGGGGCTCTGAGGCCATCAACGCACCTGGAGACAACCCTGCCAAGGTCCAGAACCACT 600 
^GGAGCCCTATTATCAGGCCGCGGCAGCAGGGGCACTCACCTGTGCAACCCCAGGGA 676 

MllllllllllllllllllllllliMMIIMIIIMIIIIIIIIIIMIIIIII 

AGGAGCCCTATTATCAGGCCGCGGCAGCAGGGGCACTCACCTGTGCAACCCCAGGGA 660 
GGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAGTGTATGCCTGTCTGCGGACGGC 736 

I ! 1 1 1 j 1 1 1 M I i ! I II i t M 1 1 1 M ! t i 1 M I M M M M 1 1 i i ( I M ( M M { M 

GGAAAGACAGACAGGATGGGGAGGAGGTTCTTCAGTGTATGCCTGTCTGCGGACGGC 72 0 
TCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTGTTCCAGAGCCAAGCTGGGCA 796 

llilllllllllMMllMlllllMIIIIIIIMtlllllMlllilillllMI 

TCACCCCCATTGCCCAGAATCAGACGACCCTCGGTTCTTCCAGAGCCAAGCTGGGCA 780 
TCCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGG 856 

MIIIIMMMMMMIIMIIIIIMIIMIIIIIIIIIIMMMMMMII 

TCCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGGGGCCCTGCTGGGGG 840 
GATGGATCCTCACTGCTGCCCACACCATCTACCCCAAGGACAGTGTTTCTCTCAGGA 916 

20 " 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

\GATGGATCCTCACTGCTGCCCACACCGTCTACCCCAAGGACAGTGTTTCTCTCAGGA 900 
^CCAGAGTGTGAATGTGTTCTTGGGCCACACAGCCATAGATGAGATGCTGAAACTGG 976 

MIIIIIIIIIMIIMMIIIIIIIIIMIIMMMIIIIIIIMMIMIIMII 

^ACCAGAGTGTGAATGTGTTCTTGGGCCACACAGCCATAGATGAGATGCTGAAACTGG 960 
QACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATA 1 0 3 1 

MMIIIMMIIIMIIMMIIIMMIMIMIIIIMIIMMIMIIIIIIII 

^ACCACCCTGTCCACCGTGTCGTTGTGCACCCCGACTACCGTCAGAATGAGTCCCATA 102 " 
ITTAGCGGGGACATCGCCCTCCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACG 109 - 

f l i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 { 1 1 1 1 1 1 

TTTAGCGGGGACATCGCCCTGCTGGAGCTGCAGCACAGCATCCCCCTGGGCCCCAACG 108 
CTCCCGGTCTGTCTGCCCGATAATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACG 115 

1 1 1 r 1 1 1 1 1 1 1 1 1 ! t E 1 1 i 1 1 1 1 1 r I i 1 1 1 1 1 1 1 1 f ! i 1 1 1 i 1 1 1 1 i 1 1 1 i 1 i 1 i 1 1 1 

CTCCCGGTCTGTCTGCCCGATAATGAGACCCTCTACCGCAGCGGCTTGTTGGGCTACG 114 
AGTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTGAAGTACTCGAGGCTGC 121 

40 " 1 1 1 1 1 1 1 1 II 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

^GTGGGTTTGGCATGGAGATGGGCTGGCTAACTACTGAGCTGAAGTACTCGAGGCTGC 120 
CGTAGCTCCCAGGGAGGCCTGCAACGCCTGGCTCCAAAAGAGACAGAGACCCGAGGTGT 127 

IMIIIIIIIIIIIIMMIIIIIIIMIIMIIIIIMMIMIIMIIIIIIIIMI 

rGTAGCTCCCAGGGAGGCCTGCAACGCCTGGCTCCAAAAGAGACAGAGACCCGAGGTGT 126 
FTCTGACAATATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGG 133 

IIIIIIIIMIIIII1IMIMMIIMMIIIIMIIMIIIIIIIIIMIMIIIII 

rTCTGACAATATGTTCTGTGTTGGGGATGAGACGCAAAGGCACAGTGTCTGCCAGGGGG 132 
HAGTGGCAGCCTCTATGTGGTATGGGACAATCATGCCCATCACTGGGTGGGCACGGGCA 13 9 

IM1IIIIII IIIIIIIIIMIIIIIIIilllllllllllMIIIIIMIMIIIIII 

^AGTGG CAGCGT CTATGTGGTATGGGACAATCATGC C CATCACTGGGTGG C CACGGG CA 138 
rGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTACACCAAGGTGCTCAGCT 145 

IIIIIIMIIIIIIIIIIIllllMIIIMIMIIIIMIMIIIMMMIMMIII 

rGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTACACCAAGGTGCTCAGCT 144 
rGTGGACTGGATCAAGGGAGTGATGAATGGCAAGAATTGACCCTGGGGGCTTGAACAGG 151 

60 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

3TGGACTGGATCAAGGGAGTGATGAATGGCAAGAATTGACCCTGGGGGCTTGAACAGG 150 
^TGACCAGCACAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACT 157 

IIMIIIIMIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

2TGACCAGCACAGTGGAGGCCCCAGGCAACAGAGGGCCTGGAGTGAGGACTGAACACT 156 
3GTAGGGGGTGGGGGT 1594 

MINIM IMMM 
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In this search it was also found that the FCTRSa nucleic acid had homology to three 
fragments of Homo sapiens complement component 1, r subcomponent. It has 102 of 1 17 bases 
(87%) identical to 1458-1574, 82 of 94 bases (87%) identical to 2052-2145, and 54 of 63 bases 
(85%) identical to 1678-1740 all fragments of Homo sapiens complement component 1, r 
5 subcomponent (GenBank Acc: NM_001733.1) (Table 5F). 

Table 5F* BLASTN of FCTRSa against Homo sapiens complement component 1, r 

subcomponent (SEQ ID NO:79) 

>GI|4502492|REF|NM_001733.1| HOMO SAPIENS COMPLEMENT COMPONENT 1, R SUBCOMPONENT 
(C1R), MRNA 
10 LENGTH = 2386 

SCORE = 113 BITS (57), EXPECT = 3E-22 
IDENTITIES = 102/117 (87%) 
STRAND = PLUS / PLUS 



15 



QUERY * 783 AGCCAAGCTGGGCAACTTCCCCTGGCAAGCCTTCACCAGTATCCACGGCCGTGGGGGCGG 842 

lllllll MIMMIIMIIIIIIII I lllilll MINI II MINIM 

SBJCT: 14 58 AG C CAAGATGGG CAACT TC C C CTGG CAGGTGT TCACCAACATCCACGGGCG CGGGGGCGG 1517 



20 QUERY: 843 GGCCCTGCTGGGGGACAGATGGATCCTCACTGCTGCCCACACCATCTACCCCAAGGA 899 

MMIMIMM Ml I IMMMIMI IMIMIIIMI I II IMIMM 

SBJCT: 1518 GGCCCTGCTGGGCGACCGCTGGATCCTCACAGCTGCCCACACCCTGTATCCCAAGGA 1574 

25 SCORE = 91.7 BITS (46), EXPECT = 1E-15 
IDENTITIES = 82/94 (87%) 
STRAND = PLUS / PLUS 

QUERY* 13 80 CTGGGTGGCCACGGGCATTGTGTCCTGGGGCATAGGGTGTGGCGAAGGGTATGACTTCTA 143 9 

30 | MMMM I III III II I MINIM Mill MM I II II Mil II Nil 

SBJCT: 2052 CTGGGTGGCCACGGGCATCGTGTCCTGGGGCATCGGGTGCAGCAGGGGCTATGGCTTCTA 2111 
QUERY : 144 0 CACCAAGGTGCTCAGCTATGTGGACTGGATCAAG 1473 

MINI lllllll III ! I! I Mil 1 1 II M 

35 SBJCT: 2112 CACCAAAGTGCTCAACTACGTGGACTGGATCAAG 2145 



SCORE = 54.0 BITS (27), EXPECT = 2E-04 
IDENTITIES = 54/63 (85%) 
40 STRAND = PLUS / PLUS 



45 



QUERY: 1006 CACCCCGACTACCGTCAGAATGAGTCCCATAACTTTAGCGGGGACATCGCCCTCCTGGAG 1065 

INN 1 1 1 1 M II II N Illlllll I II III IIIMIIIIIINI 

SBJCT: 1678 CACCCGGACTACCGTCAGGATGAGTCCTACAATTTTGAGGGGGACATCGCCCTGCTGGAG 1737 



QUERY: 1066 CTG 1068 
III 

SBJCT: 1738 CTG 174 0 



50 The amino acid sequence of the protein of FCTRSa 485 of 487 amino acid residues 

(99%) identical to, and 487 of 487 residues (100%) positive with, the 487 amino acid 
complement Clr-like proteinase precursor from Homo sapiens (GenBank- ACC: AAF44349.1) 
(SEQ ID NO:80) (Table 5G). 
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Table 5G. BLAST? of FCTRSa and b against Complement CIR-Iike proteinase precursor 

(SEQ ID NO:80) 

>Gl| 7706083 |REF|NP_057630.l| COMPLEMENT C1R-LIKE PROTEINASE PRECURSOR, [HOMO SAPIENS] 
Gl| 11436768 |REF|XP__007061.l| COMPLEMENT C1R-LIKE PROTEINASE PRECURSOR, [HOMO SAPIENS] 
5 Gl| 7271475 |GB|AAF44349.l|AF178985_l (AF178985) COMPLEMENT C1R-LIKE PROTEINASE 

PRECURSOR [HOMO SAPIENS] 
LENGTH = 4 87 

SCORE = 972 BITS (2513), EXPECT =0.0 
10 IDENTITIES = 485/487 (99%), POSITIVES = 487/487 (100%) 

R 

QUERY: 1 MPGPRWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEP 60 

1 1 1 1 } I i i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 E 1 1 1 1 1 f 1 1 1 1 1 E 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 

SBJCT: 1 MPGPRWGKYLWRSPHSKGCPGAMWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEP 60 

15 

QUERY: 61 YGKGQESSTDIKAPEGFAVRLVFQDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLG 12 0 

H M 1 1 M M 1 1 1 1 M 11 1 1 1 ! 1 1 1 1 i E ! i 1 1 1 11 1 M 1 II I M I i 1 M 1 1 1 1 1 1 1 1 1 1 1 

SBJCT: 61 YGKGQESSTDIKAPEGFAVRLVFQDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLG 120 
20 QUERY: 121 RPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPISEASRGSEA 180 

1 1 1 1 1 1 1 ! 1 1 f 1 1 1 1 1 i 1 1 1 E 1 1 M I M i ! 1 1 1 1 M 1 1 1 1 1 ! 1 1 1 [ 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 

SBJCT: 121 RPPGQREFVSSGRSLRLTFRTQPSSENKTAHLHKGFLALYQTVAVNYSQPISEASRGSEA 180 
QUERY: 181 INAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 240 

25 1 1 1 II II ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 11 1 1 1 1 1 

SBJCT: 181 INAPGDNPAKVQNHCQEPYYQAAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 240 
QUERY: 241 QNQTTLGSSRAKLGNFPWQAFTSIHGRGGGALLGDRWILTAAHTIYPKDSVSLRKNQSVN 3 00 

lillilllllMllllllllllllilMIIIMIIIIIIMIIMIilllllillllll 

30 SBJCT: 241 QNQTTLGSSRAKLGNFPWQAFTSIHGRGGGALLGDRWILTAAHTVYPKDSVSLRKNQSVN 300 

QUERY: 3 01 VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDIALLELQHSIPLGPNVLPVCL 360 

MMIIIIIIIMIIIIIMIMIIIIIMIIMIIMIIMIIMIIIIIIIMMIII 

SBJCT: 301 VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDIALLELQHSIPLGPNVLPVCL 360 

35 

QUERY: 361 PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 

1 1 1 1 1 i 1 1 1 J 1 1 1 1 1 M 1 i 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 i 1 1 1 

SBJCT: 361 PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 
40 QUERY: 421 CVGDETQRHSVCQGDSGSLYWWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIK 4 80 

MlillllllMMIIIMIMIIMIIIIIMIIMMIIIIIMIIMIIIMIili 

SBJCT: 421 CVGDETQRHSVCQGDSGSVYVVWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIK 480 

QUERY: 481 GVMNGKN 487 
45 | | | | | | | 

SBJCT: 481 GVMNGKN 4 87 

R = AT RESIDUE 46, FCTR5B DIFFERS FROM FCTRSA IN THAT Q46R. THE REST OF THE HOMOLOGY 
IS THE SAME. 

50 

The full amino acid sequence of the protein of FCTR5a has 175 of 303 amino acid 
residues (58%) identical to, and 226 of 303 residues (74%) positive with the 400-701 amino acid 
segment, 72 of 157 residues (45%) identical and 94 of 157 residues (59%) positive with amino 
55 acids 1-155, and 36 of 139 residues (25%) identical and 58 of 139 residues (40%) positive with 
amino acids 188-312 of the 705 amino acid Complement C1R Component Precursor from Homo 
sapiens (GenBank-ACC: AAA51851 .1) (SEQ ID NO:43) (Table 5H). 
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Table 5H. BLASTP of FCTRSa and b against Complement C1R Component Precursor 

(SEQ ID NO:81) 



>Gl| 115204 |SP|P00736|C1RJHUMAN COMPLEMENT C1R COMPONENT PRECURSOR 
5 GI | 67614 jPIR| |C1HURB COMPLEMENT SUBCOMPONENT C1R (EC 3.4.21.41) PRECURSOR - HUMAN 

Gl| 179644 |GB|AAA51851.l| (M14058) HUMAN COMPLEMENT C1R [HOMO SAPIENS] 
LENGTH = 705 

SCORE = 361 BITS (928) , EXPECT = 8E-99 
10 IDENTITIES = 175/303 (58%), POSITIVES = 226/303 (74%), GAPS = 9/303 (2%) 

QUERY: 189 AKVQNHCQE P YYQ AAAAGALTCATPGTWKDRQDGEEVLQCMPVCGRPVTPIA 24 0 

| ++ | +| III I 11+ I M++ + I + IIII + M 1 + 

SBJCT: 40 0 ARIQYYCHEPYYKMQTRAGSRESEQGVYTCTAQGIWKNEQKGEKIPRCLPVCGKPVNPVE 459 



15 



QUERY: 241 QNQ TT LGS S RAKLGNF P WQAFT S I HGRGGGAL LGDRWI LTAAHT I YP KD S VS LRKNQ S VN 3 00 

I | +| + II + IMMI M + MMIMMIMMIIMIM + IM + ++11++ 

SBJCT: 460 QRQRI IGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKEHEA- QSNASLD 518 



20 QUERY: 3 01 VFLGHTAIDEMLKLGNHPVHRVWHPDYRQNESHNFSGDIALLELQHS I PLGPNVLPVCL 3 60 

MINI ++I++IIIIII+ 1 1 MIIIII + II + M IIIIIMI++I+ llll + M + M 

SBJCT: 519 VFLGHTNVEELMKLGNHP IRRVS VHPDYRQDES YNFEGD I ALLELENS VTLGPNLLP I CL 578 

QUERY : 361 PDNETLYRSGLLGYVSGFGMEMGWLTTELKYSRLPVAPREACNAWLQKRQRPEVFSDNMF 420 

25 Ml+I I II+IIIMII+ + +I++ IMII +11 11+ + I +111 III 

SBJCT: 579 PDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWLRGKNRMDVFSQNMF 63 8 

QUERY- 421 CVGDETQRHSVCQGDSGSLYWWDNHAHHWVATGIVSWGIGCGEGYDFYTKVLSYVDWIK 480 

i 1 + + 1 1 1 1 1 1 ++ I I + IMIIMIIIIM II IIIMI + IIMM 

30 SBJCT: 639 CAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSRGYGFYTKVLNYVDWIK 698 

QUERY: 481 GVM 483 
I 

SBJCT: 699 KEM 701 

35 

SCORE = 122 BITS (306), EXPECT = 1E-26 
IDENTITIES = 72/157 (45%), POSITIVES = 94/157 (59%), GAPS =* 3/157 (1%) 

R 

40 QUERY: 24 MWWLLLWGVLQACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVF 83 

|| | | | 11+ + l+l ++III +1+11 I++I I I 1+ l+IM 

SBJCT: 1 MWLLYLLVPALFCRAGGSIPIPQKLFGEVTSPLFPKPYPNNFETTTVITVPTGYRVKLVF 60 

QUERY: 84 QDFDLEPSQDCAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQP 143 

45 | ||||i|+ | | | || +MM Mill III++II + I I + Ml I 

SBJCT: 61 QQFDLEPSEGCFYDYVKISADKKSLGRFCGQLGSPLGNPPGKKEFMSQGNKMLLTFHTDF 120 

QUERY: 144 S S - ENKTAHLHKGFLALYQTVAVNYSQP I SEASRGSE 179 

1+ 11 I +IMM 11 11+ + I + 1 I 
50 SBJCT: 121 SNEENGTIMFYKGFLAYYQ--AVDLDECASRSKSGEE 155 

SCORE = 36.3 BITS (83), EXPECT = 0.93 
IDENTITIES = 36/139 (25%), POSITIVES = 58/139 (40%), GAPS = 17/139 (12%) 
55 R 

QUERY: 3 5 ACPTRGSVLLAQELPQQLTSPGYPEPYGKGQESSTDIKAPEGFAVRLVF-QDFDLEPSQD 93 

+| III ++I M I + 1+ I + I I + II++ I 

SBJCT: 188 SCQAECSSELYTEASGYISSLEYPRSYPPDLRCNYSIRVERGLTLHLKFLEPFDIDDHQQ 247 

60 QUERY: 94 - -CAGDSVTISFVGSDPSQFCGQQGSPLGRPPGQREFVSSGRSLRLTFRTQPSSENKTAH 151 

I I + I 1 + +111+1 Ml + +1 ++ I I I I ++ 
SBJCT: 24 8 VHCPYDQLQIYANGKNIGEFCGKQ RPP DLDTSSNAVDLLFFTDESGDS 295 

QUERY: 152 LHKGFLALYQTVAVNYSQP 170 
65 + | + | | + || 

SBJCT: 296 - -RGWKLRYTTEI IKCPQP 312 
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R = AT RESIDUE 46, FCTR5B DIFFERS FROM FCTR5A IN THAT Q46R. THE REST OF THE HOMOLOGY 
IS THE SAME. 

Based upon homology, FCTR5 proteins and each homologous protein or peptide may 
5 share at least some activity. 



FCTR6 

The novel nucleic acid of 1078 nucleotides FCTR6a (also designated 27455183.0.19) 
encoding a novel human blood coagulation factor Xl-like protein is shown in Table 6A. An 
ORF was identified beginning with an ATG initiation codon at nucleotides 243-245 and ending 
10 with a TAA codon at nucleotides 1044-1046. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon is underlined in Table 6A, and the 
start and stop codons are in bold letters. 

Table 6A FCTR6a Nucleotide Sequence (SEQ ID NO:20) 

TTGATCCGTGCCAAGTGGCTTTTTGTGGGCTCTGTAGAGTGCTCTAAACCCAQCTCGGCCTTTGCTGTATTAGACAGAAGCACCTC 
15 ATTCATATCCCTGGGGCCCCTGATGGTGCAGTGGTCTGGCTGTGGTCTGCACACCAGCTATTCTGTTTT GTTTTGTTTTGTTTTTT 
TCCTACCTTTTTCCAATCCTCACACCTTCTGATCAACAGCCCCAGTAGGGTTTAAAGGTCCTAGAGCTACA TGGGATTTAGGTTTG 
TGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTCCCCTTCCACTTGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAA 
CTGAGTGTCGTGCTGGGGACCAACGACTTAACTAGCCCATCCATGG 

TAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTC 
20 TCCCC^CGCAGCCCGGCCCTGCCACATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTG 
AAAACGGATCTGATGAAAGTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCAA 
GTGTGCCGGATACAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGA 
AGTGGTACCAGGTGGGCATCATCAGCTGGGGAAAGAGCTGTGGAGATAAGAA 

AAC CT CTGGAT CGAGAAAGTGAC C CAGCTAGGAGGCAGG C CCTT CAATG CAGAGAAAAGGAGGAC TT CTGT CAAACAGAAAC CTAT 
25 GGGCTCCCCAGTCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCCATGTGTTGTTCAGAG 
CTAT TTTGTAC TG ATAATAAAATAGAGGC TATT CTT T GAAC CGAAA 

The FCTR6a protein encoded by SEQ ID NO:20 has 267 amino acid residues and is 
presented using the one-letter code in Table 6B. FCTR6a was searched against other databases 
30 using SignalPep and PSort search protocols. The FCTR6a protein is most likely mitochondrial 
matrix space (Certainty^ 0.4372) and seems to have no N-terminal signal sequence. The 
predicted molecular weight of FCTR6a protein is 29412.8 daltons. 

Table 6B. Encoded FCTR6a protein sequence (SEQ ID NO:21). 

MGFRFLGTANSATFETSLPLPLAPLWFSATSPEELSWLG 
35 KVPICLPTQPGPATWRECOTAGWGQTNAADKNSVKTD 

PEPGEKWYQVGIISWGKSCGDKNTPGIYTSLWYNLWIEKVTQLGGRPFNAEKRRTSVKQKPMGSPVSGVPEPGSPRSWLLLCPLS 

HVL FRAIL Y 

In an alternative embodiment, FCTR6b (alternatively referred to as 27455183.0.145) has 
40 the 1334 residue sequence shown in Table 6C. An ORF was identified beginning with an ATG 
initiation codon at nucleotides 499-501 and ending with a TAA codon at nucleotides 1300-1302. 
A putative untranslated region upstream from the initiation codon and downstream from the 
termination codon is underlined in Table 6C ? and the start and stop codons are in bold letters. 
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Table 6C FCTR6b Nucleotide Sequence (SEQ ID NO:22) 

GATTTTAGj ^ GGTTj ^ 

CGTGGGATATCAAATATCCTCTGGGGTTCGGAATGYGGGCTTATTACTGAAGATCCTGTCTGCTTGGTCAGTGGCAGGTC 
TAGACTAACTTCTGGTCCTGAGTTTCTAAAGTGCTGGTAGACCAGTTGATACAAAACAGATATAA TAATGAATGCCTTAT 
CTATCTGAAGGTCAGTTTGATCCGTGCCAAGTGGCTTTTTGTGGGCTGTGTAGAGTGCTCTAAAC CCAGCTCGGCCTTTG 
CTGTATTAGACAGAAGCACCTCATTCATATCCCTGGGGCCCCTGATGGTGC^^ 

TATTCTGTTTTGTTTTGTTTTGTTTTGTTTTTTCCTACCTTTTTCCAATCCTCACACCTTCTGATCAAC AGCCCCAGTAG 
GGTTTAAAGGTCCTAGAGCTACA TGGGATTTAGGTTTCTGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTCCCC 
TTCCACTTGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAACTGAGTGTCGTGCTGGGGACCAACGACTTAACTAGC 
CCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTCTTCACAAAGACTTTA 

CTTGCTGCTGCTGGCTTCGCCCATCAAGCTCGATGACCTGAAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTGCCA 

CATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTGAAAACGGATCTGATGAAA 

GTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCAAAAATATGCTGTGTGGCG 

CAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGTGGT 

ACCAGGTGGGCATCATCAGCTGGGGAAAGAGCTGTGGAGAGAAGAACACCCCAGGGATATACACCTCGTTGGTGAACTAC 

AACCTCTGGATCGAGAAAGTGACCCAGCTAGAGGGCAGGCCCTTCAATGCAGAGAAAAGGAGGACTTCTGTCAAACAGM 

ACCTATGGGCTCCCCAGTCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGTCCCCTGTCCCATG 

TCTTnTTrAGA^CTATTTTGTACTG ATAATAAAATAGAGGCTATTCTTTCAACCGAAA 

The FCTR6b protein encoded by SEQ ID NO:22 has 267 amino acid residues and is 
presented using the one-letter code in Table 6B. The Psort profile for FCTR4 predicts that this 
sequence has no N-terminal signal peptide and is likely to be localized at the mitochondrial 
matrix space (Certainty-0.4372). The predicted molecular weight of this protein is 29498.9 
Daltons. 

Table 6D. Encoded FCTR6b protein sequence (SEQ ID NO:23). 

MGFRFLGTANSATFETS LP LPLAPLWFS ATS PEELS W 
KVP I CLPTQPGPATWRE COTAGWGQTNAADKNS VKTDLMCT^ 

PEPGEKWYQVGIISWGKSCGEKNTPGIYTSLWYNLWIEKVTQLEGRPFNAEKRRTSVKQKP^3GSPVSGVPEPGSPRSWLLLCPLS 
HVLFRAILY 

In a search of sequence databases, it was found, for example, that the FCTR6a nucleic 
acid sequence has 853 of 897 bases (95 %) identical to bases 551-1447, and 346 of 388 bases 
(89%) identical to bases 127-513 of Macaca fascicularis brain cDNA, clone QccE- 17034 
(GENBANK-ID: |AB046651) (Table 6E). 

Table 6E. BLASTN of FCTR6a against Macaca fascicularis brain cDNA, clone QccE-17034 

(SEQ ID NO:82) 

> GI [ 9651112 | DBJ[ AB046651 . 1 [ AB046651 1VIACACA FASCICULARIS BRAIN CDNA, CLONE QCCE-17034 
LENGTH =1746 

SCORE = 1429 BITS (721) , EXPECT =0.0 
IDENTITIES = 853/897 (95%) 
STRAND = PLUS / PLUS 

QUERY * 434 CCTTTTTCCAATCCTCACACCTTCTGATCAACAGCCCCAGTAGGGTTTAAAGGTCCTAGA 4 93 

llllilllMMIIIIIIIIHIMM I IMIIMMIIIIIII 1 1 1 1 MINIM 

SBJCT: 551 CCTTTTTCCAATCCTCACACCTTCTGAGCTACAGCCCCAGTAGGGTCTAAATGTCCTAGA 610 
QUERY - 494 GCTACATGGGATTTAGGTTTCTGGGCACAGCCAATTCTGCCACTTTTGAGACTTCCCTTC 553 

Ml Ml MMMMMMM MMMMMMM MMMMMI MMMM1 

SBJCT : 611 GCTATATGAGATTTAGGTTTCTGAGCACAGCCAATTCTCCCACTTTTGAGGCTTCCCTTC 670 
QUERY- 554 CCCTTCCACTTGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAACTGAGTGTCGTGC 613 

llll! 1 1 II lillMIIIMIMIMMIIIMIIIIMMIIIIIIII 
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SBJCT: 671 CCCTTTCACTCGCCCCTCTCTGGTTCTCTGCCACCAGTCCAGAAGAACTGAATGTCGTGC 730 
JGGGACCAACGACTTAACTAGCCCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTC 673 

1 1 1 M Mil Illllll Mil I MMM1MMIIIM IIIMMMM MMMIM 

K3GGACCAACGACTTAACTAGCTCATCCATGGAAATAAAGGAGGTCGCCAGCATCATTC 790 
^CACAAAGACTTTAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTGCTGGCTT 733 

MM IIIIIIMIMIIIIIIMIMIIIIIIIIIMIIIIIIMIIMMIII I 

i-CACAAGGACTTTAAGAGAGCCAACATGGACAATGACATTGCCTTGCTGCTGCTGGCCT 850 

3CCCATCAAGCTCGATGACCTGAAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTG 793 

llllllll I i I I > I M I I M M ' ' I I 1 1 i li , I 1 I 1 I I I Illllll llllll I 
3CCCATCACACTCGATGACCTGAAGGTGCCCATCTGGCTCCCTACGCAGCACGGCCCCG 910 

^CATGGCGCGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACT 853 

llllllll 1 1 i 1 1 1 M I M 1 1 1 1 1 1 1 1 1 M 1 1 M I ! I ! II 1 1; M 1 1 1 1 M 1 1 1 ! I M 

^ACATGGCACGAATGCTGGGTGGCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACT 970 
rGTGAAAACGGATCTGATGAAAGTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAA 913 

20 ' 1 1 II I II I MM I II Mill MM III lllllllll Illllll MM MM llllll I 

3TGAAAACGGATCTGATGAAAGCGCCGATGGTCATCATGGACTGGGAGGAGTGTTCAA 103 
^TGTTTCCAAAACTTACCAAAAATATGCTGTGTGCCGGATACAAGAATGAGAGCTATG 973 

llllllllllll MMIIIIIMMMIMM llllllll lllllllllllll 

3CGTTTCCAAAACTCACCAAAAATATGCTGTGTGCTGGATACAATAATGAGAGCTATG 109 
3CCTGCAAGGGTGACAGTGGGGGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGT 103 

llllll MMMIM Mill MMMMMMIMMIIMMMMMIMM 

GCCTGCCAGGGTGACAGCGGGGGACCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGT 115 
TACCAGGTGGGCATCATCAGCTGGGGAAAGAGCTGTGGAGAGAAGAACACCCCAGGGA 109 

IIIMMMM MMMMMMMMMIIMMMMMMMMMIMIMM 



25 



30 



35 



40 



45 



60 



65 



70 



SBJCT : 


671 


QUERY : 


614 


SBJCT : 


731 


QUERY: 


674 


SBJCX : 


791 


QUERY : 


734 


SBJCT : 


851 


QUERY : 


794 


SBJCT: 


911 


QUERY : 


854 


SBJCT : 


971 


QUERY: 


914 


SBJCT : 


1031 


QUERY : 


974 


SBJCT: 


1091 


QUERY: 


1034 


SBJCT : 


1151 


QUERY: 


1094 


SBJCT : 


1211 


QUERY : 


1154 


SBJCT : 


1271 


QUERY : 


1214 


SBJCT : 


1331 


QUERY : 


1274 


SBJCT 


1391 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIIMI 

rATACACCTCGTTGGTGAACTACAACCTCTGGATCGAGAAGGTGACCCAGi 

jG c cc tt caatg cagagaaaaggaggactt c tgt caaacagaaac ct atg< 
I II M 1 1 1 Ml Illllll MMM IIMIIIIIIIIIMIMMI 

SGCCCTTCAGTGCGGAGAAAATGAGGACCTCTGTCAAACAGAAACCTATGi 

rCTCGGGAGTCCCAGAGCCAGGCAGCCCCAGATCCTGGCTCCTGCTCTGT 

Illllll lllllllllllllll III IIIMIMIIIIIMIIIIIM 
rCTCGGGGGTCCCAGAGCCAGGCGGCCTCAGATCCTGGCTCCTGCTCTGT 

VTGTGTTGTTCAGAGCTATTTTGTACTGATAATAAAATAGAGGCTATTCT 

MIIIMIMIIMMMIIIIIMMIIIIIIMIMMMIIIIM I 



50 SCORE - 428 BITS (216), EXPECT = E-117 

IDENTITIES = 346/388 (89%) , GAPS = 1/388 (0%) 
STRAND = PLUS / PLUS 

QUERY: 1 GATTTTAGAAGGTTAATCAAAAACCCGGGGACAGTTTCTTCATGGCATAACCACAGACCT 60 

55 llllllllllll III Illllll llll IMIIIMI I Mill llllllll Mill 

T TTTAGAAGGTTAATCAAAAAC C CAAGGACAGTTT CAT CATGTCATAAC CAAAGACCC 186 

GTGGCACCCGCTGTCGTGGGATATCAAATATCCTCTGGGGTTCGGAATGTGGGCTTAT 120 

IIMIIII llllll IMIMI llllllll I llllllll I t I I I i i I Mill 
GTGGCACCTGCTGTCATGGGATAACAAATATCTTGTGGGGTTCTGAATGTGGACTTAT 246 

CTGAAGATCCTGTCTGCTTGGTCAGTGGCAGGTCTAGACTAACTTCTGGTCCTGAGTT 18 0 

llllll I i 1 1 1 1 1 1 1 1 i 1 1 E r I E 1 1 1 1 IMIIMIIIIIIIIIMIIIIIIIM 

CTGAAGCTCCTGTCTGCTTGGTCAGTGG- TGGTCTAGACTAACTTCTGGTCCTGAGAT 305 
TAAAGTGCTGGTAGACCAGTTGATACAAAACAGATATAATAATGAATGCCTTATCTAT 240 

Illllll MMMIM Mill I MM I 1 1 1 1 M 1 1 M M M 1 1 1 M I llll 

TAAAGTGTTGGTAGACCGGTTGAGATAAAAGATATATAATAATGAATGCCTTACCTAT 3 65 

GAAGGTCAGTTTGATCCGTGCCAAGTGGCTTTTTGTGGGCTGTGTAGAGTGCTCTAAA 300 
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QUERY : 


1 


SBJCT : 


127 


QUERY : 


61 


SBJCT : 


187 


QUERY : 


121 


SBJCT : 


247 


QUERY : 


181 


SBJCT : 


306 


QUERY: 


241 



Mill IIIIIIIIIMIIIIIIII IIIIIMIIIIIIII II Mill 

SBJCT: 366 CTGAAAACCAGTTTGATCCGTGCCAAGGGGCTTTTTGTGGGCTCTGTAGAGTGCCCTAAA 425 
QUERY: 301 CCCAGCTCGGCCTTTGCTGTATTAGACAGAAGCACCTCATTCATATCCCTGGGGCCCCTG 3 60 

I'll IIMMIMM MMIMIIIMM II M M III Mill Ml 

SBJCT: 426 CCCAGCTCTGCCTTTGCTGTGTTAGACAGAAGCACGCCATTCACATCTCTGGGGCCCCCA 4 85 
QUERY: 361 ATGGTGCAGTGGTCTGGCTGTGGTCTGC 388 

lllllil MM Ml II II II 1 1 II 

SBJCT: 486 ATGGTGCCATGGTGTGGTTGTGGTCTGC 513 

In a search of sequence databases, it was found, for example, that the FCTR6a nucleic 
acid sequence has 295 of 378 bases (78 %) identical to bases 410-779 of Mus musculus adult 
male testis cDNA, RIKEN full-length enriched (GENBANK-ID:AK09660) (Table 6F). 

Table 6F, BLASTN of FCTR6a against Mus musculus adult male testis cDNA, RIKEN full- 
length enriched (SEQ ID NO:83) 

> GI j 12855429 [ DB J j AK0166 01 . 1 | AK016601 MUS MUSCULUS ADULT MALE TESTIS CDNA, RIKEN FULL- 
LENGTH ENRICHED 

LIBRARY, CLONE : 4933401F05 , FULL INSERT SEQUENCE 
LENGTH = 1047 



SCORE =97.6 BITS (49), EXPECT = 2E-17 
IDENTITIES = 295/378 (78%) , GAPS = 8/378 (2%) 
STRAND = PLUS / PLUS 



QUERY : 


697 


AACATGGACAATGACATTGCCTTGCTGCTGCTGGCTTCGCCCATCAAGCTCGATGACCTG 


756 


SBJCT : 


410 


1:1 M: illlMMMII IMMII II MM 1 1 1 II MM III 

AACATGGACAACGACATTGCCTTGTTGCTGCTAGCCAAGCCCTTGACGTTCAATGAGCTG 


469 


QUERY : 


757 


AAGGTGCCCATCTGCCTCCCCACGCAGCCCGGCCCTGCCACATGGCGCGAATGCTGGGTG 


816 


SBJCT : 


470 


1 IMMMMMIMI II Mill MM Ml MM MIMMMMM 

ACGGTGCCCATCTGCCTTCCTCTCTGGCCCGCCCCTCCCAGCTGGCACGAATGCTGGGTG 


529 


QUERY : 


817 


GCAGGTTGGGGCCAGACCAATGCTGCTGACAAAAACTCTGTGAAAACGGATCTGATGAAA 


876 


SBJCT : 


530 


IMM MIMI Mill 1 lllllll 1 III II IIIIIMIIIIMM 

GCAGGATGGGGCGTAACCAACTCAACTGACAAGGAATCTATGTCAACGGATCTGATGAAG 


589 


QUERY : 


877 


GTGCCAATGGTCATCATGGACTGGGAGGAGTGTTCAAAGATGTTTCCAAAACTTACCAAA 


936 


SBJCT : 


590 


Mill Ml Mill II MMMM II 1 1 IIMMIMI 1 II MM 1 

GTGCCCATGCGTATCATAGAGTGGGAGGAATGCTTACAGATGTTTCCCAGCCTCACCACA 


649 


QUERY : 


937 


AATATGCTGTGTGCCGGATACAAGAATGAGAGCTATGATGCCTGCAAGGGTGACAGTGGG 


996 


SBJCT : 


650 


II IlllMMMII III IIMMIMM IMM Ml IMMII 

AACATGCTGTGTGCCTCATATGGTAATGAGAGCTACGATGCTTGC CAGTGGG 


701 


QUERY : 


997 


GGGCCTCTGGTCTGCACCCCAGAGCCTGGTGAGAAGTGGTACCAGGTGGGCATCATCAGC 


1056 


SBJCT : 


702 


II II II IMIIMM MM Mill 1 MIIIIMMIMIIIIIIIMIII 

GGACCGCTTGTCTGCACCACAGATCCTGGCAGTAGGTGGTACCAGGTGGGCATCATCAGC 


761 


QUERY : 


1057 


TGGGGAAAGAGCTGTGGA 1074 




SBJCT : 


762 


Mill IlllMMMII 

TGGGGCAAGAGCTGTGGA 779 





The FCTR6a amino acid has 247 of 267 amino acid residues (92%) identical to, and 25 1 
of 307 residues (94%) positive with, the 267 amino acid hypothetical protein [Macaca 
fascicularis] (GenBank: AB046651) (SEQ ID NO:84) (Table 6G). 



103 



15966-697 



Table 6G. BLASTP of FCTR6a and b against hypothetical protein [Macaca fascicularis] 

(SEQ ID NO:84) 

> GI | 9651113 [DBJlBAB03569.l[ (AB046651) HYPOTHETICAL PROTEIN [MACACA FASCICULARIS] 
LENGTH =267 

SCORE = 467 BITS (1202), EXPECT = E-131 

IDENTITIES = 247/267 (92%), POSITIVES = 251/267 (94%) 

QUERY: 1 MGFRFLGTANSATFETSLPLPLAPLWFSATSPEELSWLGTNDLTSPSMEIKEVASIILH 60 

I MM MM Ml MM M M M M M M M + l M M M M I MMMMMM! 

SBJCT: 1 MRFRFLSTANSPTFEASLPLSLAPLWFSATSPEELNWLGTNDLTS SSMEIKEVAS I ILH 60 
QUERY: 61 KDFKRANMDNDIALLLLASP I KLDDLKVP I CLPTQPGPATWRECWAGWGQTNAADKNSV 120 

IMMIMMMMMMM! i ! 1 1 1 1 1 1 i 1 1 i 1 MIM MMMMMMMMM 

SBJCT: 61 KDFKRANMDNDIALLLLASPITLDDLKVPICLPTQHGPATWHECWVAGWGQTNAADKNSV 120 
QUERY: 121 KTDLMKVPMVIMDWEECSKMFPKLTKNMLCAGYKNESYDACKGDSGGPLVCT 180 

MUM MMMMMM MMMMMMI I M M M + M M M M I M M M M I 

SBJCT: 121 KTDLMKAPiWIMDWEECSKAFPKLTKNMLCAGYNNESYDACQGDSGGPLVCTPEPGEKWY 180 

K E 
QUERY: 181 QVGI ISWGKSCGDKNTPGIYTSLVNYNLWIEKVTQLGGRPFNAEKRRTSVKQKPMGSPVS 240 

M M M M M M + M M M M M M M M M M M I M M + M I MIMMMM M 

SBJCT: 181 QVGIISWGKSCGEKNTPGIYTSLVNYNLWIEKVTQLEGRPFSAEKMRTSVKQKPMGSRVS 240 
QUERY: 241 GVPEPGS PRS WLLLCPLSHVLFRAI LY 267 

MMM! MMMMMMMMMI 

SBJCT: 241 GVPEPGGLRS WLLLCPLSHVLFRAI LY 267 

K AND E ARE RESIDUES THAT DIFFER BETWEEN FCTR6A AND B. D193K, AND G217E. 

The FCTR6a amino acid has 80 of 201 amino acid residues (39%) identical to, and 119 
of 201 residues (58%) positive with, the 638 amino acid plasma kallikrein Bl precursor 
(GENBANK-ID:NPJ)00883.1) (SEQ ID NO:85) (Table 6H). 

Table 6H. BLASTP of FCTR6a and b against plasma kallikrein Bl precursor (SEQ ID 

NO:85) 

>GI|4504877|REF|NP_000883 .l| PLASMA KALLIKREIN Bl PRECURSOR; KALLIKREIN, PLASMA; 
KALLIKREIN B 

PLASMA; KALLIKREIN 3, PLASMA; FLETCHER FACTOR [HOMO 
SAPIENS] 

GI 1 125184 [ SP | P03952 ) KAL HUMAN PLASMA KALLIKREIN PRECURSOR (PLASMA PREKALLI KRE IN ) 
(KININOGENIN) 

{ FLETCHER FACTOR) 

GI\ 67591 | PIRl |KQHUP PLASMA KALLIKREIN (EC 3.4.21.34) PRECURSOR - HUMAN 
GI i 190263 |GBj AAA60153 . 1 [ (M13143) PLASMA PRE KALLIKREIN [HOMO SAPIENS] 
GI j 8809781 |GBlAAF79940.1 j (AF232742) PLASMA KALLIKREIN PRECURSOR [HOMO SAPIENS] 
LENGTH =63 8 

SCORE = 133 BITS (334), EXPECT = 3E-30 

IDENTITIES = 80/201 (39%), POSITIVES = 119/201 (58%), GAPS = 18/201 (8%) 
QUERY: 20 LPLAPLWFSATS PEELS WLGTNDLT - - S PSME I KE VAS I ILHKDFKRANMDND I ALLLL 77 

Ml +1 i +1 +1 + 1 +1 +111 M + I+++I + ++MM+ I 

SBJCT: 439 LPLQDVW RIYSGILNLSDITKDTPFSQIKE IIIHQNYKVSEGNHDIALIKL 489 

QUERY: 78 ASP I KLDDLKVP I CLPTQPGPAT-WRECWVAGWGQTNAADKNSVKTDLMKVPMVIMDWEE 136 

+1+ + + MIII++ +1 + Mi II) + +1 ++ I M + ++ M 
SBJCT: 490 QAPLNYTEFQKPICLPSKGDTSTIYTNCWVTGWGFSK- -EKGEIQNILQKVNIPLVTNEE 547 

K 

QUERY: 137 CSKMFP- - KLTKNMLCAGYKNESYDACKGDSGGPLVCTPEPGEKWYQVGI ISWGKSCGDK 194 
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I I * 1 + 1+ l + lllil IMIIIMIIIII + I III 111+ I + 

SBJCT: 548 CQKRYQDYKITQRMVCAGYKEGGKDACKGDSGGPLVC- - KHNGMWRLVGITSWGEGCARR 605 

QUERY : 195 NTPGIYTSLVNYNLWIEKVTQ 215 

ll+ll +111+11 
SBJCT: 606 EQPGVYTKVAEYMDWILEKTQ 626 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B . D193K. 

The FCTR6a amino acid has 73 of 183 amino acid residues (39%) identical to, and 1 10 
of 183 residues (59%) positive with, the 643 amino acid kallikrein [Sus scrofa] (GENBANK- 
ID:BAA37147.1) (SEQ IDNO:86) (Table 61). 

Table 6L BLASTP of FCTR6a and b against kallikrein [Sus scrofa] (SEQ ID NO:86) 

> GI [ 4165315 [ DBJ | BAA37147 . 1 [ (AB022425) KALLIKREIN [SUS SCROFA] 
LENGTH = 643 

SCORE =* 128 BITS (322), EXPECT =* 9E-29 

IDENTITIES = 73/183 (39%), POSITIVES = 110/183 (59%), GAPS = 12/183 (6%) 

QUERY: 38 VLGTNDLT - - S PSME I KE VAS 1 1 LHKDFKRANMDND I ALLLLAS P I KLDDLKVP I CLPTQ 95 

+1 +++I +1 ++II II+I+++I +IIIM 1+1+ I + IIMI++ 

SBJCT: 459 ILNI SEITKETPFSQVKE II IHQNYKILESGHD IALLKLETPLNYTDFQKP I CLPSR 515 

QUERY: 96 PGP-ATWRECWVAGWGQTNAADKNSVKTDLMKVPiWIMDWEECSKMFP- - KLTKNMLCAG 152 

+ Ml Ml I +1 ++ I II + ++ Ml I + I++I l + MI 
SBJCT: 516 DDTNWYTNCWVTGWGFTE- -EKGEIQNILQKVNIPLVSNEECQKSYRDHKISKQMICAG 573 

K 

QUERY: 153 YKNES YDACKGDSGGPLVCTPEPGEKWYQVG 1 1 SWGKS CGDKNTPGI YTSLVNYNLW IEK 212 

II IMM + IIIIIII + 1+ II 111+ I + ll + ll ++ I II + 
SBJCT: 574 YKEGGKDACKGESGGPLVC- -KYNGIWHLVGTTSWGEGCARREQPGVYTKVIEYMDWILE 631 

QUERY: 213 VTQ 215 

II 

SBJCT: 632 KTQ 634 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B. D193K. 

The FCTR6a amino acid has 81 of 205 amino acid residues (39%) identical to, and 112 
of 205 residues (54%) positive with, the 625 amino acid Coagulation factor XI [Homo sapiens] 
(embCAA64368 A) (SEQ ID NO:87) (Table 61). 

Table 6 J. BLASTP of FCTR6a and b against Coagulation factor XI [Homo sapiens] (SEQ 

ID NO:87) 

> GI | 180352 jGBjAAA51985 .1 ] (M20218) COAGULATION FACTOR XI [HOMO SAPIENS] 
LENGTH =625 

SCORE = 127 BITS (320), EXPECT = 1E-28 

IDENTITIES = 81/205 (39%), POSITIVES = 112/205 (54%), GAPS = 17/205 (8%) 
QUERY: 2 0 LPLAPLWFSATS PEELS WLGTNDLTS PSME IKE VAS 1 1 LHKDFKRANMDND I A 73 

I I ++ 11+ I I I + + MM I ll + l +1 I Ml 

SBJCT: 427 LTAAHCFYGVES PKI LRVYSG I LNQS EIKEDTSFFGVQEI IIHDQYKMAESGYDIA 482 

QUERY: 74 LLLLAS P I KLDDLKVP I CLPTQPG - PATWRECWVAGWGQTNAADKNS VKTDLMKVPMVIM 132 

II I + + I + IIMI++ + +111 III II ++ I I + ++ 

SBJCT: 4 83 LLKLETTVNYTDSQRPICLPSKGDRNVIYTDCWVTGWGYRKLRDK- - IQNTLQKAKIPLV 540 



QUERY: 133 DWEECSKMFP- -KLTKNMLCAG YKNES YDACKGDSGGPLVCTPEPGEKWYQVG I ISWGKS 190 
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Ml I + l + l l + lllh IIIIIIMIII I + 11+ Mi 111 + 

SBJCT: 541 TNEECQKRYRGHKITHKMICAGYREGGKDACKGDSGGPLSC- -KHNSVWHLVGITSWGEG 598 



QUERY: 191 CGDKNTPGIYTSLVNYNLWIEKVTQ 215 

5 | + | MM I II + II 

SBJCT : 599 CAQRERPGVYTNWE YVDW I LEKTQ 623 

K IS A RESIDUE THAT DIFFERS BETWEEN FCTR6A AND B. D193K. 

10 

The number of new cases of renal cell carcinoma in the United States in 1996 was 
projected to be 30,600 with an estimated 12,000 deaths. Tumors with a proposed histogenesis 
from the proximal tubule (clear-cell and chromophilic tumors) amount to 85% of renal cancers, 
whereas tumors with a proposed histogenesis from the connecting tubule/collecting duct 

1 5 (chromophobic-, oncocytic-, and duct Bellini-type tumors) amount to only 1 1%. 

Adenocarcinomas may be separated into clear cell and granular cell carcinomas, although 
the 2 cell types may occur together in some tumors. The distinction between well-differentiated 
renal carcinomas and renal adenomas can be difficult. The diagnosis is usually made arbitrarily 
on the basis of size of the mass, but size alone should not influence the treatment approach, since 

20 metastases can occur with lesions as small as 0.5 centimeters. 

While radical nephrectomy with regional lymphadenectomy, is the accepted, often 
curative therapy for stage I (localized disease) renal cell cancer, very little therapy is available 
for advance disease that represent about 70% of the patients. Radiotherapy as a postoperative 
adjuvant has not been effective, and when used preoperatively, may decrease local recurrence 

25 but does not appear to improve 5-yr survival. A chemotherapeutic agent capable of significantly 
altering the course of metastastic renal cell carcinoma has not been identified. (Renal Cell 
Cancer (PDQ®) Treatment - Health Professionals, Cancernet, NCI) 

There is therefore a need to identify genes that are differentially modulated in renal-cell 
carcinomas. In addition there is a need for methods to assay candidate therapeutic substances for 

30 modulating expression of these genes. These substances might be recombinant protein expressed 
by the identified genes or antibodies that bind to the identified proteins. There is yet additionally 
a need for an effective method of identifying target molecules or related components. These and 
related needs and defects are addressed in the present invention. 

35 Novel kallikrein-Iike/coagulation factor Xl-like Proteins and Nucleic Acids Encoding Same 

FCTR6 is surprisingly found to be differentially expressed in clear cell Renal cell 
carcinoma tissues vs the normal adjacent kidney tissues. The present invention discloses a novel 
protein encoded by a cDNA and/or by genomic DNA and proteins similar to it, namely, new 
proteins bearing sequence similarity to kaliikrein-like, nucleic acids that encode these proteins or 
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fragments thereof, and antibodies that bind immunospecifically to a protein of the invention. It 
may have use as a therapeutic agent in the treatmentof renal cancer and liver cirrhosis. 

The utility of kallikrein family members in protein therapy of Renal cancer 

5 The treatment of renal cell carcinoma with recombinant kallikrein could improve disease 

outcome through several potential mechanisms. The literature suggests that members of this 
protein family are inhibitory to the process of angiogenesis, a process of vital importance to 
tumor progression. Renal cell carcinoma is known to be a highly angiogenic cancer. Thus, 
treatment of renal cell carcinoma with kallikrein may effectively shutdown the active recruitment 

10 of a blood supply to a tumor. Members of this protein family are known to play a role in 

vascular coagulation. Similar to anti-angiogenic therapy, a factor produced by cancer cells that 
is pro-coagulatory may also act to inhibit cancer growth by effectively "clogging" the tumor 
vascular supply. In addition, through its proteolytic activity, kallikrein may degrade ECM 
proteins or growth factors necessary for the progressive growth of cancer cells. Following is a 

1 5 relevant reference underlining the importance of Kallikrein in cancer therapy. 

The New Human Kallikrein Gene Family: Implications in Carcinogenesis. 

Diamandis EP; Yousef GM; Luo I; Magklara I; Obiezu CV 

Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Toronto, 
20 Ontario, Canada. 

Trends Endocrinol Metab 2000 Mar;l l(2):54-60. 

ABSTRACT: The traditional human kallikrein gene family consists of three genes, 
namely KLK1 [encoding human kallikrein 1 (hKl) or pancreatic/renal kallikrein], KLK2 
(encoding hK2, previously known as human glandular kallikrein 1) and KLK3 [encoding hK3 or 
25 prostate-specific antigen (PSA)]. KLK2 and KLK3 have important applications in prostate 
cancer diagnostics and, more recently, in breast cancer diagnostics. During 

the past two to three years, new putative members of the human kallikrein gene family 
have been identified, including the PRSSL1 gene [encoding normal epithelial cell-specific 1 
gene (NES1)], the gene encoding zyme/protease M/neurosin, the gene encoding prostase/KLK- 
30 LI , and the genes encoding neuropsin, stratum corneum chymotryptic enzyme and trypsin-like 
serine protease. Another five putative kallikrein genes, provisionally named KLK-L2, KLK-L3, 
KLK-L4, KLK-L5 and KLK-L6, have also been identified. Many of the newly identified 
kallikrein-like genes are regulated by steroid hormones, and a few kallikreins (NES1, protease 
M, PSA) are known to be downregulated in breast and possibly other cancers. NES1 appears to 
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be a novel breast cancer tumor suppressor protein and PSA a potent inhibitor of angiogenesis. 
This brief review summarizes recent developments and possible applications of the newly 
defined and expanded human kallikrein gene locus. 



5 The utility of kallikrein-like/coagulation factor Xl-like family members in protein 

therapy of liver cirrosis 

Results related to inflammation shown below in Example A, Table CC3, panel 4, indicate 
over-expression of 27455183.0.19 in the liver cirrhosis sample, as compared to panel 1 data 
(Table CC1), where there is little or no expression in normal adult liver. Panel 4 was generated 
10 from various human cell lines that were untreated or resting as well as the same cells that were 
treated with a wide variety of immune modulatory molecules. There are several disease tissues 
represented as well as organ controls. 

Potential Role(s) of FCTR6 in Inflammation: 

1 5 Liver cirrhosis occurs in patients with hepatitis C and also in alcoholics. This protein is 

41% related to coagulation factor XI and its potential role in liver cirrhosis may be related to 

cleavage of kininogen. A reference for this follows: 

Thromb Haemost 2000 May;83(5):709-14 High molecular weight kininogen is cleaved 

by FXIa at three sites: Arg409-Arg410, Lys502-Thr503 and Lys325-Lys326. Mauron T, Lammle 
20 B, Wuillemin WA Central Hematology Laboratory, University of Bern, Inselspital, Switzerland. 

Abstract: 

We investigated the cleavage of high molecular weight kininogen (HK) by activated 
coagulation factor XI (FXIa) in vitro. Incubation of HK with FXIa resulted in the generation of 
cleavage products which were subjected to SDS-Page and analyzed by silverstaining, ligand- 

25 blotting and immunoblotting, respectively. Upon incubation with FXIa, bands were generated at 
1 1 1, 100, 88 kDa on nonreduced and at 76, 62 and 51 kDa on reduced gels. Amino acid sequence 
analysis of the reaction mixtures revealed three cleavage sites at Arg409-Arg410, at Lys502- 
Thr503 and at Lys325-Lys326. Analysis of HK-samples incubated with FXIa for 3 min, 10 min 
and 120 min indicated HK to be cleaved first at Arg409-Arg410, followed by cleavage at 

30 Lys502-Thr503 and then at Lys325-Lys326. In conclusion, HK is cleaved by FXIa at three sites. 
Cleavage of HK by FXIa results in the loss of the surface binding site of HK, which may 
constitute a mechanism of inactivation of HK and of control of contact system activation. 

Impact of Therapeutic Targeting of FCTR6 in Inflammation: 
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Therapeutic targeting of FCTR6 with a monoclonal antibody is anticipated to limit or 
block the extent of breakdown of kininogen and thereby reduce the degradation of liver that 
occurs in liver cirrhosis. A pertinent reference is: 

Thromb Haemost 1999 Nov;82(5):1428-32 Parallel reduction of plasma levels of high and low 
5 molecular weight kininogen in patients with cirrhosis. 

Cugno M, Scott CF, Salerno F, Lorenzano E, Muller-Esterl W, Agostoni A, Colman RW 
Department of Internal Medicine, IRCCS Maggiore Hospital, University of Milan, Italy. 
massimo.cugno@unimi.it 
Abstract: 

10 Little is known about the regulation of high-molecular-weight-kininogen (HK) and low- 

molecular-weight-kininogen (LK) or the relationship of each to the degree of liver function 
impairment in patients with cirrhosis. In this study, we evaluated HK and LK quantitatively by a 
recently described particle concentration fluorescence immunoassay (PCFIA) and qualitatively 
by SDS PAGE and immunoblotting analyses in plasma from 33 patients with cirrhosis 

1 5 presenting various degrees of impairment of liver function. Thirty-three healthy subjects served 
as normal controls. Patients with cirrhosis had significantly lower plasma levels of HK (median 
49 microg/ml [range 22-99 microg/ml]) and LK (58 microg/ml [15-100 microg/ml]) than normal 
subjects (HK 83 microg/ml [65-1 15 microg/ml]; LK 80 microg/ml [45-120 microg/ml]) 
(pO.0001). The plasma concentrations of HK and LK were directly related to plasma levels of 

20 cholinesterase (PO.0001) and albumin (PO.0001 and P<0.001) and inversely to the Child-Pugh 
score (PO.0001) and to prothrombin time ratio (PO.0001) (reflecting the clinical and laboratory 
abnormalities in liver disease). Similar to normal individuals, in patients with cirrhosis, plasma 
HK and LK levels paralleled one another, suggesting that a coordinate regulation of those 
proteins persists in liver disease. SDS PAGE and immunoblotting analyses of kininogens in 

25 cirrhotic plasma showed a pattern similar to that observed in normal controls for LK (a single 
band at 66 kDa) with some lower molecular weight forms noted in cirrhotic plasma. A slight 
increase of cleavage of HK (a major band at 130 kDa and a faint but increased band at 107 kDa) 
was evident. The increased cleavage of HK was confirmed by the lower cleaved kininogen index 
(CKI), as compared to normal controls. These data suggest a defect in hepatic synthesis as well 

30 as increased destructive cleavage of both kininogens in plasma from patients with cirrhosis. The 
decrease of important regulatory proteins like kininogens may contribute to the imbalance in 
coagulation and fibrinolytic systems, which frequently occurs in cirrhotic patients. 

In summary, the differential expression of FCTR6 (Kallikrein family) in renal cell 
carcinoma is an important finding that could have immense potential in renal carcinogenesis. In 
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additon, overexpression of the above gene in liver cirrhosis demonstrates its anticipated use as an 
immunotherapeutic target. 



FCTR7 

5 The novel nucleic acid of 1498 nucleotides FCTR7 (also designated. 32592466.0.64) 

encoding a novel trypsin inhibitor-like protein is shown in Table 7A. An ORF begins with an 
ATG initiation codon at nucleotides 470-472 and ends with a TAA codon at nucleotides 1369- 
1371. Putative untranslated regions, if any, are found upstream from the initiation codon and 
downstream from the termination codon. 

10 

Table 7A. FCTR7 Nucleotide Sequence (SEQ ID NO:24) 

AGGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCGAGCCTCCGCCGCCGAG CCTCGTTCGTGTCC 
CCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGGGGCGCCCACCCTGGGAGACTAACGAAGCAGCTC CCTTCCCACCCC^ 
CTGCAGGTCTAATTTTGGACGCTTTGCCTGCCATTTCTTCCAGGTTGAGGGAGCCGCAGAGGCGGAGGCTCGCGTATTCCTGCAGT 

15 CAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCAGGCCCTTCGCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTG AAGGCTCTGG 
GCGGCTGCAGAGGGCGGCCGTCCGGTTTGGCTCACCTCTCCCAGGAAACTTCACACTGGAGAGCCAAAAGGA GTGGAAGAGCCTGT 
CTTGGAGATTTTCCTGGGGAAATCCTGAGGTCATTCATT ATQAAGTGTACCGCGCGGGAGTGGCTCAGAGTAACCACAGTGCTGTT 
CATGGCTAGAGCAATTCCAGCCATGGTGGTTCCCAATGCCACTTTATTGGAGAAACTTTTC 
AGTGGTGGATAGCCAAACAACGAGGGAAAAGGGCCATC^ 

20 AGT CAGGTGTAT C CAACAGC C T CTAATATGGAGTATATGACATGGGATGTAGAGCTGGAAAGAT CTGC AGAAT C CAGGG CTGAAAT 
TGCTTGTGGGAACATGGACCTGCAAGCTTGCTTCCATCAATTGGACAGAATTTGGGAGCACACTGGGGAAGATATAGGCCCCCGAC 
GTTTCATGTACAATCGTGGTATGATGAAGTGAAAGACTTTAGCTACCCATATGAACATGAATGCAACCCA 
GTTCTGGCCCTGTATGTACACATTATACA<^GGTCGTGTGG^ 

ATGAACATCTGGGGGCAGATATGGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCCCC 
25 TTACAAACATGGGCGGCCCTGTTCTGCTTGCCCACCTAGT^^^ 

ACAGGTAT TAT C C CC CTCGAGAAGAGGAAACAAATGAAATAGAACGG CAG CAGT C ACAAGT C CATGACAC C CATGTCCGGACAAGA 
TCAGATGATAGTAGCAGAAATGAAGTGATTAGCTTTGGGAAAAGTAATGAAAATAT^ 

GCTATATTTTCTTAGCAGTTATTTCTACAGTTAATTACATAGTCATGATTGTTCTACGTTTCATATATTATATGGTGCTTTGTATA 
TGCCCCTAATAAAATGAATCTAAACATTGAAAAAAA 

30 

The FCTR7 protein encoded by SEQ ID NO:24 has 300 amino acid residues and is 
presented using the one-letter code in Table 7B. The FCTR7 gene was found to be expressed in: 
brain; germ cell tumors. FCTR7 gene maps to Unigene cluster Hs. 182364 which is expressed in 
the following tissues: brain, breast, ear, germ cell, heart, liver, lung, whole embryo, ovary, 

35 pancreas, pooled, prostate, stomach, testis, uterus, vascular. Therefore the FCTR7 protein 
described in this invention is also expressed in the above tissues. 

The SignalP, Psort and/or Hydropathy profile for FCTR7 predict that this sequence has a 
signal peptide and is likely to be localized outside of the cell with a certainty of 0.4228. The 
SignalP shows a cleavage site between amino acids 20 and 21, i.e., at the dash in the sequence 

40 amino acid ARA-IP. The predicted molecular weight of FCTR7 is 34739.9 Daltons. Hydropathy 
profile shows an amino terminal hydrophobic region. This region could function as a signal 
peptide and target the invention to be secreted or plasma membrane localized. 
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Table 7B. Encoded FCTR7 protein sequence (SEQ ID NO:25). 

MKCTAREWLROTTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEW 

TWDVELERSAESRAESCLWEHGPASLLPSIGQNLGAHWGRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQW 
WATSNRIGCAINLCHNMNIWGQIWPKAWLVCNYSPKGTO^ 
5 IERQQSQVHDTHWTRSDDSSRNEVISFGKSNENIMVLEILC 

This gene maps to Unigene cluster Hs.l 82364 which has been assigned the following 
mapping information shown in table 7C. Therefore the chromosomal assignment for this gene is 
the same as that for Unigene cluster 182364. 

1 o Table 7C. Mapping Information. 

Chromosome: 8 

Gene Map 98: Marker SHGC-32056 , Interval D8S279-D8S526 

Gene Map 98: Marker SGC32056 , Interval D8S526-D8S275 

Gene Map 98: Marker sts-G20223 , Interval D8S526-D8S275 

Gene Map 98: Marker stSG30385 , Interval D8S526-D8S275 

Whitehead map: EST67946, Chr.8 

dbSTS entries: G25853, G29349, G20223 

The predicted amino acid sequence was searched in the publicly available GenBank 
database 

FCTR7 protein showed Score - 743 (261 .5 bits), Expect - 1 .4e-73, P - 1 .4e-73, 54 % 
15 identities (129 over 237 amino acids) and 43% homologies (167 over 237 amino acids) with 
human 25 kD trypsin inhibitor protein (258 aa; ACC:043692) (Table 7D). 

Table 7D. BLAST X search results are shown below: 

20 ptnr:SPTREMBL-ACC:04 3692 25 KDA TRYPSIN INHIBITOR - HO... +2 743 8.4e-73 1 (SEQ 

ID N0:88) 

ptnr;SPTREMBL-ACC:044228 HRTT-1 - HALOCYNTHIA RORETZI ... +2 325 2.9e-28 1 (SEQ 
ID NO:89) 

ptnr:SWISSPROT-ACC:P48060 GLIOMA PATHOGENESIS -RELATED ... +2 314 5.3e-27 1 (SEQ 
25 ID NO: 90) 

ptnr:PIR-ID: JC4131 glioma pathogenesis-related protein... +2 309 2.0e-26 1 (SEQ 
ID NO: 91) 

ptnr:SWISSNEW-ACC: 019010 CYSTE INE - RI CH SECRETORY PROTE . . . +2 258 9,4e-21 1 (SEQ 
ID NO: 92) 



30 



The nucleotide sequence of FCTR7 has 954 of 957 residues (99 %) identical to the 1-957 
base segment, and 174 of 175 residues (99%) identical to bases 1317-1953 of the 2664 
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nucleotide Homo sapiens putative secretory protein precursor, mRNA (GenBank-ACC: 
AF 142573) (SEQ ID NO:93) (Table 7E). 

Table 7E. BLASTN of FCTR7 against Putative secretory protein precursor (SEQ ID 

NO:93) 

5 > cri | 12002310 | qb | AF142 573 . 1 | AF142573 Homo sapiens putative secretory protein 
precursor, mRNA, complete cds 
Length = 2664 

Score = 1865 bits (941), Expect = 0.0 
10 Identities = 954/957 (99%), Gaps = 1/957 (0%) 

Strand = Plus / Plus 

Query : 364 gtccggtttggctcacctctcccaggaaacttcacactggagagccaaaaggagtggaag 423 

IIIIIIIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIM 

15 Sbjct: 1 gtccggtttggctcacctctcccaggaaacttcacactggagagccaaaaggagtggaag 60 
Query: 424 agcctgtcttggagattttcctggggaaatcctgaggtcattcattatgaagtgtaccgc 483 

t i I M 1 M t M M I M 1 M I M 1 M I M 1 1 1 1 1! I M M 1 11 1 M M II M ! I M I M 1 1 

Sbjct: 61 agcctgtcttggagattttcctggggaaatcctgaggtcattcattatgaagtgtaccgc 120 

20 

Query: 484 gcgggagtggctcagagtaaccacagtgctgttcatggctagagcaattccagccatggt 543 

IIIIIIIIIMIIIIllllllllllllllllllllllllilMlllllimillllMI 

Sbjct : 121 gcgggagtggctcagagtaaccacagtgctgttcatggctagagcaattccagccatggt 180 
25 Query: 544 ggttcccaatgccactttattggagaaacttttggaaaaatacatggatgaggatggtga 603 

1 M 1 1 1 II [ M M M i 11 1 M II ! 1 1 1 1 M 11 M M It I M 11 ! M M M 1 11 ) I M 1 1 1 

Sbjct : 181 ggttcccaatgccactttattggagaaacttttggaaaaatacatggatgaggatggtga 240 
Query: 604 gtggtggatagccaaacaacgagggaaaagggccatcacagacaatgacatgcagagtat 663 

30 I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I M 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 I 1 1 1 1 1 1 1 1 

Sbjct: 241 gtggtggatagccaaacaacgagggaaaagggccatcacagacaatgacatgcagagtat 300 
Query: 664 tttggaccttcataataaattacgaagtcaggtgtatccaacagcctctaatatggagta 723 

E 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

35 Sbjct: 301 tttggaccttcataataaattacgaagtcaggtgtatccaacagcctctaatatggagta 360 
Query: 724 tatgacatgggatgtagagctggaaagatctgcagaatccagggctgaaa-ttgcttgtg 782 

IIIMIIIIIIIIIIIIIIIIMMIIIIIMIMIMI! IIMIIII! MINIMI 

Sbjct: 361 tatgacatgggatgtagagctggaaagatctgcagaatcctgggctgaaagttgcttgtg 420 

40 

Query: 783 ggaacatggacctgcaagcttgcttccatcaattggacagaatttgggagcacactgggg 842 

MIMllllllllllMMIIIIIIIIIIIMIMIIIIMIIMIMMIIIIMIIM 

Sbjct: 421 ggaacatggacctgcaagcttgcttccatcaattggacagaatttgggagcacactgggg 480 
45 Query: 843 aagatataggcccccgacgtttcatgtacaatcgtggtatgatgaagtgaaagactttag 902 

IIIIIIMIIMMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

Sbjct : 481 aagatataggcccccgacgtttcatgtacaatcgtggtatgatgaagtgaaagactttag 540 
Query: 903 ctacccatatgaacatgaatgcaacccatattgtccattcaggtgttctggccctgtatg 962 

50 II 1 1 1 1 1 M 11 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 

Sbjct: 541 ctacccatatgaacatgaatgcaacccatattgtccattcaggtgttctggccctgtatg 600 
Query : 963 tacacattatacacaggtcgtgtgggcaactagtaacagaatcggttgtgccattaattt 1022 

MIMMMMIMIMMMIMMMIIIMMIIIMMMMMMMIIMIMI 

55 Sbjct: 601 tacacattatacacaggtcgtgtgggcaactagtaacagaatcggttgtgccattaattt 660 
Query: 1023 gtgtcataacatgaacatctgggggcagatatggcccaaagctgtctacctggtgtgcaa 1082 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 f 1 1 i 1 1 1 1 1 1 1 1 1 1 i I i ! 1 1 1 E f I i I f } 1 1 1 1 i 1 1 i I ( J 

Sbjct : 6 61 gtgtcataacatgaacatctgggggcagatatggcccaaagctgtctacctggtgtgcaa 720 
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Query: 1083 ttactccccaaagggaaactggtggggccatgccccttacaaacatgggcggccctgttc 1142 

lllllllilllllllllllllllllllllllllMIIIIMIIIIIIMIMMMIIII 

Sbjct: 721 ttactccccaaagggaaactggtggggccatgccccttacaaacatgggcggccctgttc 780 
Query: 1143 tgcttgcccacctagttttggagggggctgtagagaaaatctgtgctacaaagaagggtc 1202 

MIIIIIIMIIIIIIIinilllllllllllMilllllMlllllllllimilMI 

Sbjct: 781 tgcttgcccacctagttttggagggggctgtagagaaaatctgtgctacaaagaagggtc 840 
Query: 1203 agacaggtattatccccctcgagaagaggaaacaaatgaaatagaacggcagcagtcaca 12 62 

lllllllllilllllMMillMIIIIIMIIIIMIMMIIilll MINIUM! 

Sbjct : 841 agacaggtattatccccctcgagaagaggaaacaaatgaaatagaacgacagcagtcaca 900 
Query: 1263 agtccatgacacccatgtccggacaagatcagatgatagtagcagaaatgaagtcat 1319 

IMMIMMIIMIMMMIMMMIMIMIMMMIIIMIMMIIMM 

Sbjct : 901 agtccatgacacccatgtccggacaagatcagatgatagtagcagaaatgaagtcat 957 

Score = 339 bits (171); Expect = 3e~90 
Identities = 174/175 (99%) 
Strand = Plus / Plus 

Query: 1317 cattagctttgggaaaagtaatgaaaatataatggttttagaaatcctgtgttaaatatt 1376 

M Mill Mill IMM Mill MIMMMMMMMIM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 1779 cattagctttgggaaaagtaatgaaaatataatggttttagaaatcctgtgttaaatatt 1838 
Query 1377 gctatattttcttagcagttatttctacagttaattacatagtcatgattgttctacgtt 1436 

IMIMMIMIMMIMMMIMMIMIMIMMMIMMMIMIIIIIMM 

Sbjct: 1839 gctatattttcttagcagttatttctacagttaattacatagtcatgattgttctacgtt 1898 
Query: 1437 tcatatattatatggtgctttgtatatgcccctaataaaatgaatctaaacattg 1491 

MIMMIMIMIMIIIIIMIIMIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

Sbjct: 1899 tcatatattatatggtgctttgtatatgccactaataaaatgaatctaaacattg 1953 

The FCTR7 amino acid has 284 of 285 amino acid residues (99%) identical to, and 284 
of 285 amino acid residues (99%) similar to, the 500 amino acid Putative secretory protein 
precursor [Homo sapiens] (GenBank-Acc No.: AF142573) (SEQ ID NO:94) (Table 7F). 

Table 7F. BLASTP alignments of FCTR7 against Putative secretory protein precursor, 

(SEQ ID NO:94) 

> cri [ 12 002311 1 crb 1 AAG4 3287 . 1 [ AF142573 1 (AF142573) putative secretory protein 
precursor [Homo sapiens] 
Length = 500 

Score = 581 bits (1499) , Expect = e-165 

Identities = 284/285 (99%), Positives - 284/285 (99%) 

Query 1 MKCTAREWLRVTTVLFMARAIPAMWPNATL^ 60 

IMIMMIIIMIIilllllMMMIIiMIIIIIMMMlillMllllllllill 

Sbjct: 1 MKCTAREWLRVTWLFMAI^ 60 
Query: 61 DMQSILDLHNKLRSQWPTASI^EYMTWDVELERSAESRAESCLWEHGPASLLPSIGQNL 120 

1 1 1 1 1 ! 1 1 f 1 1 1 1 [ I E 1 1 1 1 1 1 1 E I i 1 1 E 1 1 1 1 1 1 1 1 1 MMIIMIIIIIMMIIIl 

Sbjct: 61 DMQSILDLHNKLRSQWPTASlSnVIEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNL 120 
Query: 121 GAHWGRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVVWATSNRIG 180 

M t M 1 1 1 1 1 i i M 1 1 1 1 M 1 1 1 1 i t M 1 M I M 1 1 1 1 M M i M 1 1 1 1 1 1 M I i 1 1 [ 1 1 

Sbjct: 121 GAHWGRYRPPTFHVQSWYDEV1CDFSYPYEHECNPYCPFRCSGPVCTHYTQVVWATSNRIG 180 

Query: 181 CAINLCHNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLC 240 
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I : ' ■! • ; ''I M . I: U' I I i I' I I I. 

Sbjct: 181 CAINLCHIS^IWGQIWPKAWLVC^ 240 
Query: 241 YKEGSDRYYPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVIS 285 

5 lllllilllMIIMIMIMIIIIIIIMIillllllMIIIII 

Sbjct: 241 YKEGSDRYYPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVIS 285 

The FCTR7 amino acid has 137 of 176 amino acid residues (78%) identical to, and 151 
of 1 76 amino acid residues (86%) similar to, the 188 amino acid Late gestation lung protein 1 
1 0 [Rattus norvegicus] (GenBank-Acc No.: AF 1 09674) (SEQ ID NO:95) (Table 7G). 

Table 7G. BLASTP alignments of FCTR7 against Late gestation lung protein 1, (SEQ ID 

NO:95) 

> qi | 4324682 |qb[AAD16986 .l| (AF109674) late gestation lung protein 1 [Rattus 
norvegicus] 
15 Length = 188 

Score = 277 bits (709) , Expect = le-73 

Identities = 137/176 (78%), Positives = 151/176 (86%) 

20 Query 68 LHNKLRSQVYPTASNMEYMTWDVELERSAESRAESCLWEHGPASLLPSIGQNLGAHWGRY 12 7 

1 1 1 1 1 1 III! IMIIIMM HUM I MIMMIMI IIIMI Mill 

Sbjct: 2 LHNKLRGQWPPASNMEYMTWDEELERS AAAWAQRCLWEHGPAS LLVS IGQNLAVHWGRY 61 
Query: 128 RPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLCH 187 

25 I I IMIMMIIII++MI IMM + II 1 1 1 1 +IIIIII + IIII + I + IIII++ i 

Sbjct: 62 RSPGFHVQSWYDEVKDYTYPYPHECNPWCPERCSGAMCTHYTQMVWATTNKIGCAVHTCR 121 
Query* 188 NIVnSTIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKE 243 

+ I++II II llililliliilli I IliiMMM II MINI llll + l 

30 Sbjct: 122 SMSVWGDIWENAVYLVCNYSPKGNWIGEAPYKHGRPCSECPSSYGGGCRNNLCYRE 177 

The FCTR7 amino acid has 130 of 237 amino acid residues (55%) identical to, and 165 
of 237 amino acid residues (70%) similar to, the 258 amino acid R3H domain-containing 
preproprotein; 25 kDa trypsin inhibitor [Homo sapiens] (GenBank-Acc No.: D45027) (SEQ ID 
35 NO:96) (Table 7H). 

Table 7H. BLASTP alignments of FCTR7 against R3H domain-containing preproprotein, 

25 kDa trypsin inhibitor (SEQ ID NO:96) 

> cri | 7705676 | ref 1 NP_056 970 . 1 1 R3H domain- containing preproprotein; 25 kDa 
40 trypsin inhibitor; R3H 

domain (binds single -stranded nucleic acids) containing 
[Homo sapiens] 

gri[ 2 94 3716 jdbj [ BAA250 66 . 1 [ (D45027) 25 kDa trypsin inhibitor [Homo sapiens] 
Length = 258 



45 



Score = 265 bits (678) , Expect = 4e-70 

Identities = 130/237 (55%), Positives = 165/237 (70%), Gaps = 3/237 (1%) 



Query: 12 TTVLFMARAI PAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDNDMQS ILDLHNK 71 

50 +| |+ + + | | +| |+ +| I ! I I I 1+ I I I +1 I I I 1 + 

Sbjct: 2 0 STWLLNS TDS S PPTNNFTD IEAALKAQLDS AD 1 PKARRKRYISQNDMI AILDYHNQ 76 



114 



15966-697 



Query 72 LRSQVYPTASNMEYMTWDVELERS AESRAES CLWEHGPASLLPS IGQNLGAHWGRYRPPT 131 

+ | +| + | l + lllll II I III I I + I + IM + II + 1111 Mil 
Sbjct: 77 VRGKVFPPAANMEYMVWDENLAKSAEAWAATCIWDHGPSYLLRFLGQNLSVRTGRYRSIL 136 

5 Query 132 FHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLCHNMNI 191 

|+ IIMIII+++II +111 II II II + IIIIII + IIIIIMIIIM + I MI + 

Sbjct: 137 QLVKPWYDEVKDYAFPYPQDCNPRCPMRCFGPMCTHYTQMVWATSNRIGCAIHTCQNMNV 196 

Query 192 WGQIWPKAWLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 248 
10 || +| + MMMM + IMM I I II I I lll+llll+ll I +111+ + I 

Sbjct: 197 WGSVWRRAVYLVCNYAPKGNWIGEAPYKVGVPCSSCPPSYGGSCTDNLCFPGVTSNY 253 

The FCTR7 amino acid has 109 of 233 amino acid residues (47%) identical to, and 146 
of 233 amino acid residues (63%) similar to, the 253 amino acid Novel protein similar to a 
1 5 trypsin inhibitor [Homo sapiens] 25 kDa trypsin inhibitor (EMBLAcc No.: AL1 17382) (SEQ ID 
NO:97) (Table 71). 

Table 71. BLASTP alignments of FCTR7 against Novel protein similar to a trypsin 

inhibitor, (SEQ ID NO:97) 

20 

> qi | 9885193 | emb | CAC0419Q . 1 1 (AL117382) dJ881L22.3 (novel protein similar to a 
trypsin 

inhibitor) [Homo sapiens] 
Length = 253 

Score = 225 bits (575) , Expect = 4e-58 
Identities = 109/233 (47%) , Positives = 146/233 (63%) , Gaps = 8/233 (3%) 

Query: 10 RVTWLFMARAIPAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDNDMQSILDLH 69 

30 + M II I I +1 + I + + I I I 1+ I I ++I M 

Sbjct: 19 QAVNALIMPNATPAPAQPESTAMRLL SGLEVPRYRRKRHI SVRDMNALLDYH 70 

Query 70 NKLRSQVYPTASNMEYMTWDVELERSAESRAESCLWEHGPASLLPSIGQNLGAHWGRYRP 129 

| +1+ Ml l + lllll II MM I l + l MI+ 1+ +MM I l + l I 
35 Sbjct: 71 NHIRASVYPPAANMEYMWDKRLARAAEAWATQCIWAHGPSQIjMRYVGQNIjSIHSGQYRS 130 

Query 13 0 PTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLCHNM 189 

++I l+l + +1 +1 ll+l l+l I II l+l 1 1 l+l I l+l I l+l 1 1 1+ I ++ 

Sbjct: 131 WDLMKSWSEEKWHYLFPAPRDCHPHCPWRCDGPTCSHYTQMVWASSNRLGCAIHTCSSI 190 



25 
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Query: 190 NIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYK 242 

++M I +1 II II 11+ MM I +111 I + IM + IIII+ I I l + l + l 

Sbjct: 191 SVWGNTWHRAAYLVCNYAIKGNWIGESPYKMGKPCSSCPPSYQGSCNSNMCFK 243 



The FCTR7 amino acid has 129 of 237 amino acid residues (54%) identical to, and 167 
of 237 amino acid residues (70%) similar to, the 258 amino acid 25 kDa Trypsin Inhibitor from 
Homo sapiens (EMBLAcc No.: 043692) (SEQ ID NO:88) (Table 7 J). 

Table 7J. BLASTP alignments of FCTR7 against 25 kDa Trypsin Inhibitor, (SEQ ID 
50 NO:88) 
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ptnr:SPTREMBL-ACC:043692 25 KDA TRYPSIN INHIBITOR - Homo sapiens (Human), 258 

aa. 

Score = 743 (261.5 bits), Expect = 1.6e-73, P = 1.6e-73 
Identities = 129/237 (54%), Positives « 167/237 (70%) 

5 

The FCTR7 amino acid has 79 of 193 amino acid residues (40%) identical to, and 1 10 of 
193 amino acid residues (56%) similar to, the 266 amino acid Glioma Pathogenesis-Related 
Protein (RTVP-1 Protein) - Homo sapiens (SWISSPROT Acc No.: P48060) (SEQ ID NO:90) 
(Table 7K). 

10 Table 7K. BLASTP alignments of FCTR7 against Glioma Pathogenesis-Related Protein, 

(SEQ ID NO:90) 

ptnr:SWISSPROT-ACC:P48060 GLIOMA PATHOGENESIS-RELATED PROTEIN (RTVP-1 PROTEIN) 
- Homo sapiens (Human), 2 66 aa 

15 Score = 314 (110.5 bits), Expect = 4.7e-28, P = 4.7e-28 
Identities = 79/193 (40%), Positives = 110/193 (56%) 

The FCTR7 amino acid has 66 of 186 amino acid residues (35%) identical to, and 91 of 
1 86 amino acid residues (48%) similar to, the 1 86 amino acid Neutrophil granules matrix 
20 glycoprotein SGP28 precursor from Homo sapiens (SWISSPROT Acc No.: S68691) (SEQ ID 
NO:98) (Table 7L). 

Table 7L. BLASTP alignments of FCTR7 against Neutrophil granules matrix glycoprotein, 

(SEQ ID NO:98) 

25 ptnr:PIR-ID:S68691 neutrophil granules matrix glycoprotein SGP28 precursor - 

human 

Score = 254 (89.4 bits), Expect = l.le-21, P - l.le-21 
Identities = 66/186 (35%), Positives = 91/186 (48%) 

30 A novel developmental^ regulated gene with homology to a tumor derived trypsin 

inhibitor is expressed in lung mesenchyme, as described in Am. J. Physiol. 0:0-0(1999). cDNA 
cloning of a novel trypsin inhibitor with similarity to pathogenesis-related proteins, and its 
frequent expression in human brain cancer cells is disclosed in Biochim. Biophys. Acta 
1395:202-208(1998). RTVP-1, a novel human gene with sequence similarity to genes of diverse 

35 species, is expressed in tumor cell lines of glial but not neuronal origin, as published in Gene 
1 80:125-130(1996). The human glioma pathogenesis-related protein is structurally related to 
plan pathogenesis-related proteins and its gene is expressed specifically in brain tumors (Gene 
159:131-135(1995)). Structure comparison of human glioma pathogenesis-related protein GliPR 
and the plant pathogenesis-related protein PI 4a indicates a functional link between the human 
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immune system and a plant defense system (Proc. Natl. Acad. Sci. U.S.A. 95:2262-2266(1998)). 
GliPR is highly expressed in the human brain tumor, glioblastoma multiform/astrocytome, but 
neither in normal fetal or adult brain tissue, nor in other nervous system tumors. GliPR belongs 
to a family that groups mammalian SCP/TPX1 ; insects AG3/AG5; FUNGI SC7/SC14 and plants 
PR-L SGP28, a novel matrix glycoprotein in specific granules of human neutrophils with 
similarity to a human testis-specific gene product and to a rodent sperm-coating glycoprotein 
(FEBS Lett. 380, 246-250, 1996). The primary structure and properties of helothermine, a 
peptide toxin that blocks ryanodine receptors is described in Biophys. J. 68:2280-2288(1995). As 
GliPR, Helothermine belongs to a family that groups mammalian SCP/TPX1 ; insects AG3/AG5; 
FUNGI SC7/SC14 and plants PR-1. 

Based upon homology, FCTR7 protein and each homologous protein or peptide may 
share at least some activity. 

Therapeutic uses: 

FCTR7 protein has homology to trypsin inhibitors, Q91055 helothermine, tumor derived 
tyrpsin inhibitors, glioma pathogenesis-related protein, Q9Z0U6 LATE GESTATION LUNG 
PROTEIN l , and to the Prosite family which groups mammalian SCP/TPX1 ;INSECTS 
AG3/AG5; FUNGI SC7/SC14 AND PLANTS PR-1 proteins. Therefore the FCTR7 protein 
disclosed in this invention could function like the proteins which it has homology to. These 
functions include tissue development in vitro and in vivo, and cancer pathogenesis. 

Based the tissue expression pattern, the gene is implicated in diseases of tissues in which 
it is expressed. These diseases include but are not limited to: 

• Glioma, 

• cancer, 

• lung diseases, 

• gestation, 

• male and female reproductive diseases, 

• deafness, 

• neurological disorders, 

• gastric disorders, and 

• pancreatic diseases like diabetes. 



These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel FCTR7 substances for use in therapeutic or diagnostic methods. 
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These antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-FCTRX Antibodies" section below. In one 
embodiment, a contemplated FCTR7 epitope is from aa 40 to 120. In another embodiment, a 
FCTR7 epitope is from aa 130 to 170. In additional embodiments, FCTR7 epitopes are from aa 
210 to 230, and from aa 240 to 280. 



TABLE 8A: Summary Of Nucleic Acids And Proteins Of The Invention 



Name 


Tables 


Clone; Description ofHomolog 


Nucleic Acid 
cfo TD NO 


Amino Acid 
SEO ID NO 


FCTR1 


1A ? 1B ? 


joUyzzl j.U.io ioilisiaiin-iiKe proiem 


1 
1 


2 


FCTR2 


2A, 2B 


AC012614 1.0.123; KIAA1061-like protein 


3 


4 


FCTR3 


3A, 3B 


10129612.0.1 18; neurestin-IiKe protem 








3C, 3D 


10129612.0.405; neurestin-like protein 


7 


8 




3E 


10129612.0.154; neurestin-like protein 


A 

y 






3F 


10129612.0.67; neurestin-like protein 


1 u 






3G 


10129612.0.258; neurestin-like protein 


1 1 






3H, 31 


10129612.0.352; neurestin-like protein 






FCTR4 


4A ? 4B 


29692275.0.1; NF-Kappa-B Pod deltaJ -nice 
protein 


1 A 


1 S 


FCTR5 


5 A, 5B 


32125243.0.21; human complement UiK 
component precursor -like protein 


1 £ 


17 

i / 




5C ? 5D 




1 R 

1 o 


19 

i v 


FCTR6 


6A ? 6B 


2/43!)ioi.u.iy ? novel numan dioou 
coaguiaiion iacior -a.i -iikc pioicm 




21 




rr^ /rr\ 
OC ? OIJ 


IIA^*^] S'i 0. 1 /t < - nAVpl Vmm?m HIaaH 
Z /tO.? 1 0 J.U. 14 J ? llOVCI IlUiilall UlUtJU. 

nnarmlcitirm fs^tAr ^TT -llVp TYTAtPITl 
COd^UId,U.UJU lavlUl y\JL ~ 11 JVC piwixan 


22 


23 


FCTR7 


7A ? 7B 


32592466.0.64; trypsin inhibitor -like protein 


24 


25 


FCTR1 


Example 2 


Ag809 Forward 


26 




FCTR1 


Example 2 


Ag809 Probe 


27 




FCTR1 


Example 2 


Ag809 Reverse 


28 




FCTR4 


Example 2 


Ag2773 Forward 


29 




FCTR4 


Example 2 


Ag2773 Probe 


30 




FCTR4 


Example 2 


Ag2773 Reverse 
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FCTR5 


Example 2 


Ag427 Forward 


32 




FCTR5 


Example 2 


Ag427 Probe 


33 




FCTR5 


Example 2 


Ag427 Reverse 
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FCTR6 


Example 2 


Agl541 Forward 


35 




FCTR6 


Example 2 


Agl541 Probe 


36 




FCTR6 


Example 2 


Agl541 Reverse 


37 





TABLE 8B: Summary of Query Sequences Disclosed 



Table 


Database 


Acc. No. 


Sequence Name 


Species 


SEQ ID NO. 


1C, IK 


remtrEmbl 


BAA21725 


IGFBP-like protein 


mouse 


38 


ID 


sptrEmbl 


Q61581 


Follistatin-like protein-2 


Mouse 


39 
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IE J 


SptrEmbl ( 


307822 1 


vlac25 protein * 


iuman L 


in 


IF, IK 5 


SptrEmbl ( 


3888 12 I 


Vlac25 protein I 


vlouse L 




1G, IK 5 


SptrEmbl < 


316270 1 


^rostacyclin-stimulating factor 1 


-luman 1 


10 


1H, IK 


3 IR 


B40098 < 


Colorectal cancer suppressor 1 


lat L 


13 


11 


rrEmblne i 

ft 


AAD9360 1 


PTP sigma (brain) precursor 1 


Tuman L 




1J ! 


SptrEmbl < 


Q13332 1 


PTP sigma precursor J 


-luman 1 




2C < 


GenBank , 


AB028984 ] 


KIAA1061 cDNA 1 


Human 


46 


2D 


TrEmblne '. 
w 


BAA85677 ] 


KIAA1263 


Human 


47 


2E 


TrEmblne 
vv 


BAA83013 


KIAA1061 protein fragment 


Human 


48 


2F 


Embl 


CAB70877.1 


Hypothetical protein DKFzp566D234.1 


Human 


4y 


2G 


GenBank 


Q62632 


Follistatin-related protein- 1 precursor 


Rat 


JV 


2H 


GenBank 


Q62536 


Follistatin-related protein- 1 precursor 


Mouse 


51 


21 


GenBank 


JG0187 


Follistatin related protein 


African 
clawed 

r 

frog 


52 


2J 


GenBank 


Q12841 


Follistatin related protein- 1 precursor 


Human 


D3 


2K 


Embl 


CAB42968.1 


Flik protein 


LnicKen 


j4 


2L 


GenBank 


T13822 


Frazzled gene protein 


Fruit tly 


<z 


2M 


GenBank 


AAC3 8849.1 


Roundabout 1 


Fruit fly 


56 


2N 


GenBank 


060469 


Down Syndrome Cell Adhesion Molecule 
Precursor 


Human' 


57 


20 


SwissProt 


Q13449 


Limbic system-associated membrane 
protein precursor 


Human 


58 


2P 


SptrEmbl 


070246 


Putative neuronal cell adhesion molecule, 
short form 


Mouse 


59 


2Q 


SptrEmbl 


002869 


CHLAMP, Gll-isoform precursor 


Chicken 


60 


2R 


SwissProt 


Q62813 


Limbic system-associated membrane 
protein precursor 


Rat 


61 


3J 


GenBank 


NM_0 11 856.2 


Odd Oz/ten-m homology 2 


Fruit fly 


oz 


3K 


Embl 


AJ245711.1 


Teneurin-2 cDNA, short splice variant 


Chicken 


63 


3L 


GenBank 


AB032953 


KIAA1127 cDNA 


Human 


64 


3M, 3U 


GenBank 


AB025411 


Ten-m2 cDNA 


Mouse 


OD 


3N 


GenBank 


NM_020088.1 


Neurestin alpha cDNA 


Rat 


oo 


30 


Embl 


GGA278031 


Teneurin-2 


Chicken 


CI 
o / 


3P 


GenBank 


NP_035986.2 


Odd Oz/ten-m homology 2 


Fruit fly 


DO 


3Q 


Embl 


CAC09416.1 


Teneurin-2 


Chicken 


oy 


3R 


GenBank 


BAA77399.1 


Ten-m4 


Mouse 


f\) 


3S 


GenBank 


AB032953 


KIAA1127 protein 


T T 

Human 


/I 


3T 


GenBank 


AF086607 


Neurestin alpha 


Rat 


72 


4C 


SptrEmbl 


Q99233 


Hypothetical 10 kD protein 


Trypanos 
ome 


73 


4C 


SptrEmbl 


Q16896 


GAB A receptor subunit 




74 


4C 


SptrEmbl 


076473 


GAB A receptor subunit 




75 


4C 


TrEmblne 
w 


AAD28317 


FI3J11.13 protein 




JO 
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Text p. 90 


SptrEmbl 


Q13313 


NF-kappa B P65 delta 3 protein 


Human 


77 


5E 


GenBank 


XM_007061.1 


Complement CI R-like proteinase 
precursor 


Human 


78 


5F 


GenBank 


NM_001733.1 


Complement component 1 , R 
subcomponent cDNA 


Human 


79 


5G 


GenBank 


AAF44349.1 


Complement C lR-like proteinase 
precursor 


Human 


80 


5H 


GenBank 


AAA5185.1 


Complement C1R component precursor 


Human 




6E 


GenBank 


AB046651 


Brain cDNA clone Qcc-17034 


Macaque 


82 


6F 


GenBank 


AK09660 


Adult testis cDNA, RIKEN full length 
enriched 


Mouse 


83 


6G 


GenBank 


AB046651 


Hypothetical protein 


Macaque 


84 


6H 


GenBank 


NP_000838.1 


Plasma kallikreinBl precursor 


Human 


85 


61 


GenBank 


BAA37 147.1 


Kallikrein 


Pig 


86 


6J 


Embl 


CAA64368.1 


Coagulation factor XI 


Human 


87 


7D, 7J 


SptrEmbl 


043692 


25 kDa trypsin inhibitor 


Human 


88 


7D 


SptrEmbl 


044228 


HRTT-1 




89 


7D, 7K 


SptrEmbl 


P4 18060 


Glioma pathogenesis-related protein 


Human 


90 


7D 


PIR-ID 


JC4131 


Glioma pathogenesis-related protein 


Human 


91 


7D 


SwissProt 


019010 


Cysteine-rcih secretory protein 




92 


7E 


GenBank 


AF 142573 


Putatitive secretory protein precursor 
cDNA 


Human 


93 


7F 


GenBank 


AF142573 


Putative secretory protein precursor 


Human 


94 


7G 


GenBank 


AF 109674 


Late gestation lung protein 1 


Rat 


95 


7H 


GenBank 


D45027 


R3H domain containing preprotein, 25 
kDa trypsin inhibitor 


Human 


96 


71 


Embl 


AL1 17382 


Novel protein similar to a trypsin 
inhibitor 


Human 


97 


7L 


PIR-ID 


S68691 


Neutrophil granules matrix glycoprotein 
SGP28 precursor 


Human 


98 



FCTRX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
FCTRX polypeptides or biologically-active portions thereof. Also included in the invention are 

5 nucleic acid fragments sufficient for use as hybridization probes to identify FCTRX-encoding 
nucleic acids (e.g., FCTRX mRNAs) and fragments for use as PCR primers for the amplification 
and/or mutation of FCTRX nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA 
molecules (e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 

1 0 derivatives, fragments and homologs thereof. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably is comprised double-stranded DNA. 
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An FCTRX nucleic acid can encode a mature FCTRX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full length 
5 gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more naturally 
occurring processing steps as they may take place within the cell, or host cell, in which the gene 
product arises. Examples of such processing steps leading to a "mature" form of a polypeptide 

10 or protein include the cleavage of the N-terminal methionine residue encoded by the initiation 
codon of an ORF, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a 
mature form arising from a precursor polypeptide or protein that has residues 1 to N, where 
residue 1 is the N-terminal methionine, would have residues 2 through N remaining after 
removal of the N-terminal methionine. Alternatively, a mature form arising from a precursor 

1 5 polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from 
residue 1 to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise from a 
step of post-translational modification other than a proteolytic cleavage event. Such additional 
processes include, by way of non-limiting example, glycosylation, myristoylation or 

20 phosphorylation. In general, a mature polypeptide or protein may result from the operation of 
only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable length, 
preferably between at least about 10 nucleotides (nt), 100 nt, or as many as approximately, e.g., 
6,000 nt, depending upon the specific use. Probes are used in the detection of identical, similar, 

25 or complementary nucleic acid sequences. Longer length probes are generally obtained from a 
natural or recombinant source, are highly specific, and much slower to hybridize than shorter- 
length oligomer probes. Probes may be single- or double-stranded and designed to have 
specificity in PCR, membrane-based hybridization technologies, or ELISA-like technologies. 
The term "isolated" nucleic acid molecule, as utilized herein, is one which is separated 

30 from other nucleic acid molecules which are present in the natural source of the nucleic acid. 

Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid 
(i.e., sequences located at the 5 - and 3'-termini of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, the 
isolated FCTRX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 
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0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
5 techniques, or of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or a 

10 portion of the nucleic acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, 
and 24 as a hybridization probe, FCTRX molecules can be isolated using standard hybridization 
and cloning techniques (e.g., as described in Sambrook, et al. 9 (eds.), Molecular Cloning: A 
Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989; and Ausubel, et ah, (eds.), Current Protocols in Molecular Biology, John Wiley & 

15 Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 

20 FCTRX nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues, 
which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 

25 sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 

complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably about 15 nt 
to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic 
acid molecule less than 100 nt in length would further comprise at least 6 contiguous nucleotides 

30 of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, or a complement thereof. 
Oligonucleotides may be chemically synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NOS:l, 
3,5,7,9, 10, 11, 12, 14, 16, 18,20, 22, and 24, or a portion of this nucleotide sequence (e. g. , a 
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fragment that can be used as a probe or primer or a fragment encoding a biologically-active 
portion of an FCTRX polypeptide). A nucleic acid molecule that is complementary to the 
nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, is 
one that is sufficiently complementary to the nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 
5 7, 9, 10, 1 1 , 12, 14, 16, 18, 20, 22, and 24, that it can hydrogen bond with little or no mismatches 
to the nucleotide sequence shown in SEQ ID NOS:l,3,5,7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 
24, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 

10 physical or chemical interaction between two polypeptides or compounds or associated 

polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der 
Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or 
indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 

1 5 effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 

20 respectively, and are at most some portion less than a full length sequence. Fragments may be 
derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. 
Derivatives are nucleic acid sequences or amino acid sequences formed from the native 
compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 

25 compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 
activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences 
of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 

30 analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 

analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 70%, 80%, or 95% identity (with a 
preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
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when compared to an aligned sequence in which the alignment is done by a computer homology 
program known in the art, or whose encoding nucleic acid is capable of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, moderately 
stringent, or low stringent conditions. See e.g. Ausubel, et ah, CURRENT PROTOCOLS IN 
5 Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those sequences 
coding for isoforms of FCTRX polypeptides. Isoforms can be expressed in different tissues of 

10 the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 

isoforms can be encoded by different genes. In the invention, homologous nucleotide sequences 
include nucleotide sequences encoding for an FCTRX polypeptide of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, rabbit, dog, 
cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not 

15 limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set 

forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide 
sequence encoding human FCTRX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID 
NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, as well as apolypeptide possessing FCTRX 

20 biological activity. Various biological activities of the FCTRX proteins are described below. 

An FCTRX polypeptide is encoded by the open reading frame ("ORF") of an FCTRX 
nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 

25 "start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 
For the purposes of this invention, an ORF may be any part of a coding sequence, with or 
without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a stretch 
of DNA that would encode a protein of 50 amino acids or more. 

30 The nucleotide sequences determined from the cloning of the human FCTRX genes 

allows for the generation of probes and primers designed for use in identifying and/or cloning 
FCTRX homologues in other cell types, e.g. from other tissues, as well as FCTRX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
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hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 or 
400 consecutive sense strand nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 
16, 18, 20, 22, and 24; or an anti-sense strand nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 
10, 11, 12, 14, 16, 18, 20, 22, and24; or ofanaturally occurring mutant ofSEQ ID NOS:l, 3,5, 
5 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24. 

Probes based on the human FCTRX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group can 
be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can 

10 be used as a part of a diagnostic test kit for identifying cells or tissues which mis-express an 

FCTRX protein, such as by measuring a level of an FCTRX-encoding nucleic acid in a sample of 
cells from a subject e.g., detecting FCTRX mRNA levels or determining whether a genomic 
FCTRX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an FCTRX polypeptide" refers to 

15 polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 

polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of FCTRX 1 ' can be prepared by isolating a portion of SEQ ID NOS:l, 3, 5, 7, 9, 
10, 11, 12, 14, 16, 18,20, 22, and 24, that encodes a polypeptide having an FCTRX biological 

20 activity (the biological activities of the FCTRX proteins are described below), expressing the 
encoded portion of FCTRX protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of FCTRX. 

FCTRX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
25 sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, due to 

degeneracy of the genetic code and thus encode the same FCTRX proteins as that encoded by the 
nucleotide sequences shown in SEQ IDNONOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 
24. In another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
sequence encoding a protein having an amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 
30 13, 15, 17, 19, 21, 23, and 25. 

In addition to the human FCTRX nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 7, 
9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, it will be appreciated by those skilled in the art that 
DNA sequence polymorphisms that lead to changes in the amino acid sequences of the FCTRX 
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polypeptides may exist within a population (e.g., the human population). Such genetic 
polymorphism in the FCTRX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an FCTRX protein, 
5 preferably a vertebrate FCTRX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the FCTRX genes. Any and all such nucleotide 
variations and resulting amino acid polymorphisms in the FCTRX polypeptides, which are the 
result of natural allelic variation and that do not alter the functional activity of the FCTRX 
polypeptides, are intended to be within the scope of the invention. 

10 Moreover, nucleic acid molecules encoding FCTRX proteins from other species, and thus 

that have a nucleotide sequence that differs from the human sequence of SEQ ID NOS:I , 3, 5, 7, 
9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the FCTRX 
cDNAs of the invention can be isolated based on their homology to the human FCTRX nucleic 

1 5 acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention 
is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 

20 18, 20, 22, and 24. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 500, 
750, 1000, 1500, or 2000 or more nucleotides in length. In yet another embodiment, an isolated 
nucleic acid molecule of the invention hybridizes to the coding region. As used herein, the term 
"hybridizes under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 60% homologous to each other typically 

25 remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding FCTRX proteins derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridization with all or a portion of the particular human sequence as a probe using 
methods well known in the art for nucleic acid hybridization and cloning. 

30 As used herein, the phrase "stringent hybridization conditions" refers to conditions under 

which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 
sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
sequences. Generally, stringent conditions are selected to be about 5°C lower than the thermal 
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melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of 
the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 
5 at equilibrium. Typically, stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g. , 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 

10 agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, et 
ah, (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 

15 other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to the sequences of SEQ ID NOS:l, 3, 5, 7, 

20 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, corresponds to a naturally-occurring nucleic acid 

molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or 
DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural 
protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
25 molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 
18, 20, 22, and 24, or fragments, analogs or derivatives thereof, under conditions of moderate 
stringency is provided. A non-limiting example of moderate stringency hybridization conditions 
are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS and 100 mg/ml denatured 
salmon sperm DNA at 55°C, followed by one or more washes in IX SSC, 0.1% SDS at 37°C. 
30 Other conditions of moderate stringency that may be used are well-known within the art. See, 
e.g., Ausubel, et al (eds.), 1993, Current Protocols IN MOLECULAR Biology, John Wiley & 
Sons, NY, and Kriegler, 1990; Gene Transfer and Expression, A Laboratory Manual, 
Stockton Press, NY. 
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In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 
comprising the nucleotide sequences of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, 
and 24, or fragments, analogs or derivatives thereof, under conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions are hybridization 
in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 
40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 
0.1% SDS at 50°C. Other conditions of low stringency that may be used are well known in the 
art {e.g., as employed for cross-species hybridizations). See, e.g., Ausubel, et ah (eds.), 1993, 
Current Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, 
Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; Shilo and 
Weinberg, 1981 . Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

In addition to naturally-occurring allelic variants of FCTRX sequences that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences of SEQ ID NO NOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 
20, 22, and 24, thereby leading to changes in the amino acid sequences of the encoded FCTRX 
proteins, without altering the functional ability of said FCTRX proteins. For example, nucleotide 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues can be 
made in the sequence of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. A 
"non-essential" amino acid residue is a residue that can be altered from the wild-type sequences 
of the FCTRX proteins without altering their biological activity, whereas an "essential" amino 
acid residue is required for such biological activity. For example, amino acid residues that are 
conserved among the FCTRX proteins of the invention are predicted to be particularly non- 
amenable to alteration. Amino acids for which conservative substitutions can be made are well- 
known within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding FCTRX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
FCTRX proteins differ in amino acid sequence from SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 
23, and 25, yet retain biological activity. In one embodiment, the isolated nucleic acid molecule 
comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino 
acid sequence at least about 45% homologous to the amino acid sequences of SEQ ID NOS:2, 4, 
6, 8, 13, 15, 17, 19, 21, 23, and 25. Preferably, the protein encoded by the nucleic acid molecule 
is at least about 60% homologous to SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25; more 
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preferably at least about 70% homologous to SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 
25; still more preferably at least about 80% homologous to SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25; even more preferably at least about 90% homologous to SEQ ID NOS:2, 4, 6, 
8, 13, 15, 17, 19, 21, 23, and 25; and most preferably at least about 95% homologous to SEQ ID 
5 NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. 

An isolated nucleic acid molecule encoding an FCTRX protein homologous to the protein 
of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, can be created by introducing one or 
more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID 
NOSrl, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, such that one or more amino acid 

10 substitutions, additions or deletions are introduced into the encoded protein. 

Mutations can be introduced into SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, 
by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are made at one or more predicted, 
non-essential amino acid residues. A "conservative amino acid substitution" is one in which the 

15 amino acid residue is replaced with an amino acid residue having a similar side chain. Families 
of amino acid residues having similar side chains have been defined within the art. These 
families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side 
chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, 
glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, 

20 leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted non-essential amino acid residue in the FCTRX protein 
is replaced with another amino acid residue from the same side chain family. Alternatively, in 
another embodiment, mutations can be introduced randomly along all or part of an FCTRX 

25 coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 
for FCTRX biological activity to identify mutants that retain activity. Following mutagenesis of 
SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, the encoded protein can be expressed by 
any recombinant technology known in the art and the activity of the protein can be determined. 
The relatedness of amino acid families may also be determined based on side chain 

30 interactions. Substituted amino acids may be fully conserved "strong" residues or fully 

conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one of 
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the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant FCTRX protein can be assayed for (i) the ability to form 
protein:protein interactions with other FCTRX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (ii) complex formation between a mutant FCTRX protein 
and an FCTRX ligand; or (/if) the ability of a mutant FCTRX protein to bind to an intracellular 
target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant FCTRX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 16, 18, 20, 22, and 24, or fragments, 
analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that 
is complementary to a "sense" nucleic acid encoding a protein (e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence). 
In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire FCTRX 
coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of an FCTRX protein of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25; or antisense nucleic acids complementary to an FCTRX nucleic acid 
sequence of SEQ ID NOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24, are additionally 
provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding an FCTRX protein. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues. In another embodiment, the antisense nucleic acid molecule is 
antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding the 
FCTRX protein. The term "noncoding region" refers to 5' and 3' sequences which flank the 
coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding the FCTRX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson and 
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Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to 
the entire coding region of FCTRX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of FCTRX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of FCTRX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis or enzymatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids (e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an FCTRX protein to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription and/or translation). The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule that binds to DNA duplexes, through specific interactions in the major groove of the 
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double helix. An example of a route of administration of antisense nucleic acid molecules of the 
invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. For 
example, for systemic administration, antisense molecules can be modified such that they 
specifically bind to receptors or antigens expressed on a selected cell surface (e.g., by linking the 
antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the vectors 
described herein. To achieve sufficient nucleic acid molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual (3-units, the 
strands run parallel to each other. See, e.g., Gaultier, et al, 1987. Nucl. Acids Res. 15: 
6625-6641 . The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide 
(see, e.g., Inoue, et al. 1987. Nucl. Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue 
(see, e.g., Inoue, et al, 1987. FEBSLett. 215: 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes 
are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes as described inHaselhoff and Gerlach 1988. 
Nature 334: 585-591) can be used to catalytically cleave FCTRX mRNA transcripts to thereby 
inhibit translation of FCTRX mRNA. A ribozyme having specificity for an FCTRX-encoding 
nucleic acid can be designed based upon the nucleotide sequence of an FCTRX cDNA disclosed 
herein (i.e., SEQ ID NOS:l, 3, 5, 7, 9, 10, 11, 12, 14, 16, 18, 20, 22, and 24). For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence 
of the active site is complementary to the nucleotide sequence to be cleaved in an 
FCTRX-encoding mRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al. and U.S. Patent 
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5,1 16,742 to Cech, et al. FCTRX mRNA can also be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al, (1993) 
Science 261:1411-1418. 

Alternatively, FCTRX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the FCTRX nucleic acid (e.g., the FCTRX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
FCTRX gene in target cells. See, e.g., Helene, 1991 . Anticancer Drug Des. 6: 569-84; Helene, et 
al. 1992. Ann. N.Y. Acad. Set 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the FCTRX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of 
the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See, e.g., Hyrup, et al, 1996. BioorgMed Chem 4: 
5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics 
(e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup, et al, 1996. supra; 
Perry-O'Keefe, et al, 1996. Proc. Natl. Acad. Sci. USA 93: 14670-14675. 

PNAs of FCTRX can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of 
FCTRX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases (see, Hyrup, et al, \996.supra); or as probes or primers 
for DNA sequence and hybridization (see, Hyrup, et al, 1996, supra; Perry-O'Keefe, et al, 
1996. supra). 

In another embodiment, PNAs of FCTRX can be modified, e.g., to enhance their stability 
or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in 
the art. For example, PNA-DNA chimeras of FCTRX can be generated that may combine the 
advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes 
(e.g., RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion 
would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using 
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linkers of appropriate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (see, Hyrup, etal., 1996. supra). The synthesis of PNA-DNA 
chimeras can be performed as described in Hyrup, et al, 1996. supra and Finn, et al, 1996. Nucl 
Acids Res 24: 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5'-(4-methoxytrityl)amino-5*-deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA. See, e.g., Mag, etal., 1989. Nucl Acid Res 17: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA 
segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 3* PNA segment. See, e.g., Petersen, 
etal, 1975. Bioorg. Med. Chem. Lett. 5: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger, et al, 1989. Proc. Natl. Acad. Set U.S.A. 86: 6553-6556; 
Lemaitre, et al, 1987. Proc. Natl. Acad. Sci. 84: 648-652; PCT Publication No. WO88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In addition, 
oligonucleotides can be modified with hybridization triggered cleavage agents (see, e.g., Krol, et 
al, 1988. BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988. Pharm. Res. 5: 
539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, and the like. 

FCTRX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino acid 
sequence of FCTRX polypeptides whose sequences are provided in SEQ ID NOS .2, 4, 6, 8, 13, 
15, 17, 19, 21, 23, and 25. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 13, 
15, 17, 19, 21, 23, and 25, while still encoding a protein that maintains its FCTRX activities and 
physiological functions, or a functional fragment thereof. 

In general, an FCTRX variant that preserves FCTRX-like function includes any variant in 
which residues at a particular position in the sequence have been substituted by other amino 
acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues from 
the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the 
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invention. In favorable circumstances, the substitution is a conservative substitution as defined 
above. 

One aspect of the invention pertains to isolated FCTRX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-FCTRX antibodies. In one 
embodiment, native FCTRX proteins can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 
embodiment, FCTRX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an FCTRX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the FCTRX protein is derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The language "substantially free of cellular 
material" includes preparations of FCTRX proteins in which the protein is separated from 
cellular components of the cells from which it is isolated or recombinantly-produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations of 
FCTRX proteins having less than about 30% (by dry weight) of non-FCTRX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-FCTRX proteins, still more preferably less than about 10% of non-FCTRX proteins, and 
most preferably less than about 5% of non-FCTRX proteins. When the FCTRX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably substantially 
free of culture medium, i.e., culture medium represents less than about 20%, more preferably less 
than about 10%, and most preferably less than about 5% of the volume of the FCTRX protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of FCTRX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations of 
FCTRX proteins having less than about 30% (by dry weight) of chemical precursors or 
non-FCTRX chemicals, more preferably less than about 20% chemical precursors or 
non-FCTRX chemicals, still more preferably less than about 10% chemical precursors or 
non-FCTRX chemicals, and most preferably less than about 5% chemical precursors or 
non-FCTRX chemicals. 
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Biologically-active portions of FCTRX proteins include peptides comprising amino acid 
sequences sufficiently homologous to or derived from the amino acid sequences of the FCTRX 
proteins (e.g., the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, 
and 25) that include fewer amino acids than the full-length FCTRX proteins, and exhibit at least 
one activity of an FCTRX protein. Typically, biologically-active portions comprise a domain or 
motif with at least one activity of the FCTRX protein. A biologically-active portion of an 
FCTRX protein can be a polypeptide which is, for example, 10, 25, 50, 100 or more amino acid 
residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native FCTRX protein. 

In an embodiment, the FCTRX protein has an amino acid sequence shown in SEQ ID 
NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. In other embodiments, the FCTRX protein is 
substantially homologous to SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and retains 
the functional activity of the protein of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, yet 
differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail, below. Accordingly, in another embodiment, the FCTRX protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid sequence of 
SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and retains the functional activity of the 
FCTRX proteins of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the 
sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino 
acid positions or nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same amino acid residue or nucleotide as the corresponding position in the 
second sequence, then the molecules are homologous at that position (i.e., as used herein amino 
acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known in 
the art, such as GAP software provided in the GCG program package. See, Needleman and 
Wunsch, 1970. JMol Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
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of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a 
degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the 
CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 10, 1 1, 12, 14, 
16, 18, 20, 22, and 24. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of positions 
at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) 
occurs in both sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in the region of comparison (i.e., the window 
size), and multiplying the result by 100 to yield the percentage of sequence identity. The term 
"substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, 
wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually 
at least 99 percent sequence identity as compared to a reference sequence over a comparison 
region. 

Chimeric and Fusion Proteins 

The invention also provides FCTRX chimeric or fusion proteins. As used herein, an 
FCTRX "chimeric protein" or "fusion protein" comprises an FCTRX polypeptide operatively- 
linked to a non-FCTRX polypeptide. An "FCTRX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to an FCTRX protein (SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 
19, 21, 23, and 25), whereas a "non-FCTRX polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to a protein that is not substantially homologous to the 
FCTRX protein, e.g., a protein that is different from the FCTRX protein and that is derived from 
the same or a different organism. Within an FCTRX fusion protein the FCTRX polypeptide can 
correspond to all or a portion of an FCTRX protein. In one embodiment, an FCTRX fusion 
protein comprises at least one biologically-active portion of an FCTRX protein. In another 
embodiment, an FCTRX fusion protein comprises at least two biologically-active portions of an 
FCTRX protein. In yet another embodiment, an FCTRX fusion protein comprises at least three 
biologically-active portions of an FCTRX protein. Within the fusion protein, the term 
"operatively-linked" is intended to indicate that the FCTRX polypeptide and the non-FCTRX 
polypeptide are fused in-frame with one another. The non-FCTRX polypeptide can be fused to 
the N-terminus or C-terminus of the FCTRX polypeptide. 
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In one embodiment, the fusion protein is a GST-FCTRX fusion protein in which the 
FCTRX sequences are fused to the C-terminus of the GST (glutathione S-transferase) sequences. 
Such fusion proteins can facilitate the purification of recombinant FCTRX polypeptides. 

In another embodiment, the fusion protein is an FCTRX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of FCTRX can be increased through use of a heterologous 
signal sequence. 

In yet another embodiment, the fusion protein is an FCTRX-immunoglobulin fusion 
protein in which the FCTRX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The FCTRX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an 
interaction between an FCTRX ligand and an FCTRX protein on the surface of a cell, to thereby 
suppress FCTRX-mediated signal transduction in vivo. The FCTRX-immunoglobulin fusion 
proteins can be used to affect the bioavailability of an FCTRX cognate ligand. Inhibition of the 
FCTRX ligand/FCTRX interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g. promoting or inhibiting) 
cell survival. Moreover, the FCTRX-immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce anti-FCTRX antibodies in a subject, to purify FCTRX ligands, 
and in screening assays to identify molecules that inhibit the interaction of FCTRX with an 
FCTRX ligand. 

An FCTRX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional techniques, 
e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers that give rise to complementary overhangs between two consecutive gene 
fragments that can subsequently be annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Ausubel, et al. (eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John 
Wiley & Sons, 1992). Moreover, many expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST polypeptide). An FCTRX-encoding nucleic acid 
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can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the 
FCTRX protein. 

FCTRX Agonists and Antagonists 

The invention also pertains to variants of the FCTRX proteins that function as either 
FCTRX agonists {i.e., mimetics) or as FCTRX antagonists. Variants of the FCTRX protein can 
be generated by mutagenesis {e.g., discrete point mutation or truncation of the FCTRX protein). 
An agonist of the FCTRX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the FCTRX protein. An antagonist of the FCTRX 
protein can inhibit one or more of the activities of the naturally occurring form of the FCTRX 
protein by, for example, competitively binding to a downstream or upstream member of a 
cellular signaling cascade which includes the FCTRX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment of 
a subject with a variant having a subset of the biological activities of the naturally occurring form 
of the protein has fewer side effects in a subject relative to treatment with the naturally occurring 
form of the FCTRX proteins. 

Variants of the FCTRX proteins that function as either FCTRX agonists {i.e., mimetics) 
or as FCTRX antagonists can be identified by screening combinatorial libraries of mutants {e.g., 
truncation mutants) of the FCTRX proteins for FCTRX protein agonist or antagonist activity. In 
one embodiment, a variegated library of FCTRX variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated 
library of FCTRX variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential FCTRX 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
proteins {e.g., for phage display) containing the set of FCTRX sequences therein. There are a 
variety of methods which can be used to produce libraries of potential FCTRX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential FCTRX sequences. 
Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Itakura, et ah, 1984. Annu. Rev. Biochem. 53: 323; Itakura, et 
al, 1984. Science 1 98: 1056; Ike, et ah, 1983. Nucl. Acids Res. 1 1 : 477. 
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Polypeptide Libraries 

In addition, libraries of fragments of the FCTRX protein coding sequences can be used to 
generate a variegated population of FCTRX fragments for screening and subsequent selection of 
variants of an FCTRX protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an FCTRX coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double-stranded DNA that can include 
sense/antisense pairs from different nicked products, removing single stranded portions from 
reformed duplexes by treatment with Si nuclease, and ligating the resulting fragment library into 
an expression vector. By this method, expression libraries can be derived which encodes 
N-terminal and internal fragments of various sizes of the FCTRX proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of FCTRX proteins. The most widely 
used techniques, which are amenable to high throughput analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates isolation 
of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique that enhances the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to identify FCTRX variants. See, e.g., Arkin 
and Yourvan, 1992. Proc. Natl. Acad. Sci. USA 89: 781 1-7815; Delgrave, et al., 1993. Protein 
Engineering 6:327-33 1 . 

Anti-FCTRX Antibodies 

The invention encompasses antibodies and antibody fragments, such as F ab or (F a b)2,that 
bind immunospecifically to any of the FCTRX polypeptides of said invention. 

An isolated FCTRX protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind to FCTRX polypeptides using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length FCTRX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of FCTRX proteins for 
use as immunogens. The antigenic FCTRX peptides comprises at least 4 amino acid residues of 
the amino acid sequence shown in SEQ ID NO NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, and 
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encompasses an epitope of FCTRX such that an antibody raised against the peptide forms a 
specific immune complex with FCTRX. Preferably, the antigenic peptide comprises at least 6, 8, 
10, 15, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes preferable over 
shorter antigenic peptides, depending on use and according to methods well known to someone 
skilled in the art. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of FCTRX that is located on the surface of the protein (e.g., a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 
art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 
without Fourier transformation (see, e.g., Hopp and Woods, 1981. Proc. Nat. Acad. Sci. USA 78: 
3824-3828; Kyte and Doolittle, 1982. J. Mol. Biol. 157: 105-142, each incorporated herein by 
reference in their entirety). 

As disclosed herein, FCTRX protein sequences of SEQ ID NOS:2, 4, 6, 8, 13, 15, 17, 19, 
21, 23, and 25, or derivatives, fragments, analogs or homologs thereof, may be utilized as 
immunogens in the generation of antibodies that immunospecifically-bind these protein 
components. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically-active portions of immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site that specifically-binds (immunoreacts with) an antigen, such as FCTRX. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 
F ab and F (a b')2 fragments, and an F ab expression library. In a specific embodiment, antibodies to 
human FCTRX proteins are disclosed. Various procedures known within the art may be used for 
the production of polyclonal or monoclonal antibodies to an FCTRX protein sequence of SEQ ID 
NOS:2, 4, 6, 8, 13, 15, 17, 19, 21, 23, and 25, or a derivative, fragment, analog or homolog 
thereof. Some of these proteins are discussed below. 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by injection with the native protein, or a 
synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, recombinantly-expressed FCTRX protein or a chemically- 
synthesized FCTRX polypeptide. The preparation can further include an adjuvant. Various 
adjuvants used to increase the immunological response include, but are not limited to, Freund's 
(complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances 
(e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
human adjuvants such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar 
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immunostimulatory agents. If desired, the antibody molecules directed against FCTRX can be 
isolated from the mammal {e.g., from the blood) and further purified by well known techniques, 
such as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, 
refers to a population of antibody molecules that contain only one species of an antigen binding 
site capable of immunoreacting with a particular epitope of FCTRX. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular FCTRX protein with 
which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular 
FCTRX protein, or derivatives, fragments, analogs or homologs thereof, any technique that 
provides for the production of antibody molecules by continuous cell line culture may be 
utilized. Such techniques include, but are not limited to, the hybridoma technique {see, e.g., 
Kohler & Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique {see, e.g., Kozbor, et al, 1 983. Immunol. Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies {see, e.g., Cole, et al, 1985. In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 
monoclonal antibodies may be utilized in the practice of the invention and may be produced by 
using human hybridomas {see, e.g., Cote, et al, 1983. Proc Natl Acad Sci USA 80: 2026-2030) 
or by transforming human B-cells with Epstein Barr Virus in vitro {see, e.g., Cole, et al, 1985. 
In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Each of 
the above citations is incorporated herein by reference in their entirety. 

According to the invention, techniques can be adapted for the production of single-chain 
antibodies specific to an FCTRX protein {see, e.g., U.S. Patent No. 4,946,778). In addition, 
methods can be adapted for the construction of Fab expression libraries {see, e.g., Huse, et al, 
1989. Science 246: 1275-1281) to allowrapid and effective identification of monoclonal F ab 
fragments with the desired specificity for an FCTRX protein or derivatives, fragments, analogs 
or homologs thereof. Non-human antibodies can be "humanized" by techniques well known in 
the art. See, e.g., U.S. Patent No. 5,225,539. Antibody fragments that contain the idiotypes to an 
FCTRX protein may be produced by techniques known in the art including, but not limited to: (/) 
an F ( ab')2 fragment produced by pepsin digestion of an antibody molecule; {if) an F ab fragment 
generated by reducing the disulfide bridges of an F (ab ' )2 fragment; (zn) an F ab fragment generated 
by the treatment of the antibody molecule with papain and a reducing agent; and (z'v) F v 
fragments. 

Additionally, recombinant anti-FCTRX antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be made 
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using standard recombinant DNA techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in International Application 
No. PCT/US86/02269; European Patent Application No. 184,187; European Patent Application 
No. 171,496; European Patent Application No. 173,494; PCT International Publication No. WO 
86/01533; U.S. Patent No. 4,816,567; U.S. Pat. No. 5,225,539; European Patent Application No. 
125,023; Better, et al, 1988. Science 240: 1041-1043; Liu, et al, 1987. Proc. Natl. Acad. Sci. 
USA 84: 3439-3443; Liu, et al, 1987. J. Immunol. 139: 3521-3526; Sun, et al, 1987. Proc. Natl. 
Acad. Sci. USA 84: 214-218; Nishimura, et al, 1987. Cancer Res. 47: 999-1005; Wood, et al., 

1985. Nature 314 :446-449; Shaw, etal, 1988. J. Natl. Cancer Inst. 80: 1553-1559); 
Morrison(1985) Science 229:1202-1207; Oi, etal. (1986) BioTechniques 4:214; Jones, et al, 

1986. Nature 321: 552-525; Verhoeyan, etal, 1988. Science 239: 1534; and Beidler, etal, 
1988. J. Immunol. 141: 4053-4060. Each of the above citations are incorporated herein by 

reference in their entirety. 

In one embodiment, methods for the screening of antibodies that possess the desired 
specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and 
other immunologically-mediated techniques known within the art. In a specific embodiment, 
selection of antibodies that are specific to a particular domain of an FCTRX protein is facilitated 
by generation of hybridomas that bind to the fragment of an FCTRX protein possessing such a 
domain. Thus, antibodies that are specific for a desired domain within an FCTRX protein, or 
derivatives, fragments, analogs or homologs thereof, are also provided herein. 

Anti-FCTRX antibodies may be used in methods known within the art relating to the 
localization and/or quantitation of an FCTRX protein {e.g., for use in measuring levels of the 
FCTRX protein within appropriate physiological samples, for use in diagnostic methods, for use 
in imaging the protein, and the like). In a given embodiment, antibodies for FCTRX proteins, or 
derivatives, fragments, analogs or homologs thereof, that contain the antibody derived binding 
domain, are utilized as pharmacologically-active compounds (hereinafter "Therapeutics"). 

An anti-FCTRX antibody (e.g., monoclonal antibody) can be used to isolate an FCTRX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipitation. 
An anti-FCTRX antibody can facilitate the purification of natural FCTRX polypeptide from cells 
and of recombinantly-produced FCTRX polypeptide expressed in host cells. Moreover, an 
anti-FCTRX antibody can be used to detect FCTRX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the FCTRX protein. 
Anti-FCTRX antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
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clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 

5 materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
(5-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 

10 luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 
examples of suitable radioactive material include 125 1, 131 1, 35 S or 3 H. 

FCTRX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an FCTRX protein, or derivatives, fragments, analogs or 

1 5 homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a host 

20 cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively-linked. Such vectors are referred to herein as 

25 "expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
(e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve 

30 equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 
recombinant expression vectors include one or more regulatory sequences, selected on the basis 
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